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RESULT 1 
AAB60699 

ID AAB60699 standard; protein; 302 AA. 
XX 

AC AAB60699; 
XX 

DT ll-SEP-2003 (revised) 

DT 22-MAY-2001 (first entry) 

XX 

DE Mouse IgG signal/human BAFF-R/ human IgG Fc fusion protein, BAFF-R-Fc. 
XX 

KW Human BAFF-R; BAFF receptor; TNF family; immunoregulatory agent; 

KW immune-related disorder; B-cell growth inhibitor; 

KW B-cell maturation inhibitor; immunoglobulin production inhibitor; 

KW autoimmune disorder; B-cell lymphoprolif erative disorder; hypertension; 

KW renal disorder; immunosuppressive disorder; HIV infection; 



KW organ transplantation; antiinflammatory; systemic lupus erythematosus; 

KW autoimmune haemolytic anaemia; Grave's disease; multiple myeloma; 

KW B-cell carcinoma; leukaemia; rapidly progressive glomerulonephritis; 

KW lymphoma; gene therapy; cancer; tumour; IgG Fc; fusion construct. 
XX 

OS Homo sapiens. 

OS Mus sp . 

OS Chimeric. 
XX 

PN WO200112812-A2. 
XX 

PD 22-FEB-2001. 
XX 

PF 16-AUG-2000; 2 00 0WO-US0225 07 . 
XX 

PR 17-AUG-1999; 99US- 014 937 8 P . 

PR ll-FEB-2000; 2000US-0181684P . 

PR 18-FEB-2000; 2000US-0183536P . 
XX 

PA (BIO J ) BIOGEN INC. 

PA (APOT-) APOTECH R&D SA. 

XX 

PI Mackay F, Browning J, Ambrose C, Tschopp J, Schneider P; 

PI Thompson J; 

XX 

DR WPI; 2001-202866/20. 

DR N-PSDB; AAF59999. 
XX 

PT Inhibiting dendritic cell-induced B-cell growth, maturation and B-cell 

PT lympho-prolif erative disorder by administering BAFF-receptor polypeptide, 

PT chimeric molecule comprising receptor or anti-BAFF-R antibody homolog. 
XX 

PS Example 4; Fig 2; 59pp; English. 
XX 

CC The invention relates to the use of a BAFF receptor (BAFF-R, also known 

CC as BCMA) protein, or a BAFF-R fusion protein as an agent for the 

CC treatment of a variety of immune-related disorders. BAFF-R is a member of 

CC the TNF (tumour necrosis factor) family, acting as an immunoregulatory 

CC agent, and also plays a role in the development of hypertension and 

CC related disorders. BAFF-R, fusion proteins containing it, and BAFF-R- 

CC specific antibodies can be used for inhibiting B-cell growth, dendritic 

CC cell-induced B-cell growth and maturation, and immunoglobulin production, 

CC and in the treatment of autoimmune disorders, B-cell lymphoprolif erative 

CC disorders, hypertension and renal disorders. The BAFF-R proteins may also 

CC be used in the treatment of immunosuppressive disorders and HIV 

CC infection, and in patients undergoing organ transplantation. The BAFF-R 

CC proteins or BAFF-R specific antibodies may be used for treating, 

CC suppressing or altering an immune response involving a signalling pathway 

CC between BAFF-R and BAFF, thereby inhibiting inflammation. Since BAFF-R 

CC inhibits B-cell growth and maturation it is useful for treating diseases 

CC such as systemic lupus erythematosus, autoimmune haemolytic anaemia, 

CC Grave's disease, multiple myeloma, B-cell carcinomas, leukaemia, rapidly 

CC progressive glomerulonephritis, and lymphomas. Nucleic acids encoding 

CC human BAFF-R may be used in gene therapy to treat tumours, lymphomas, 

CC autoimmune disorders and inherited B-cell-associated disorders. The 

CC present sequence represents the BAFF-R fusion protein BAFF-R-Fc, 

CC comprising a mouse IgG-kappa signal sequence, residues 1-153 of human 



CC BAFF-R and a human IgG Fc sequence. (Updated on ll-SEP-2003 to 

CC standardise OS field) 

XX 

SQ Sequence 302 AA; 

Query Match 100.0%; Score 1643; DB 4; Length 302; 

Best Local Similarity 100.0%; Pred. No. 1.4e-lll; 

Matches 302; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 METDTLLLWVLLLWVPGSTGDVTMLQMAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTC 60 

I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I M I I I I 

Db 1 METDTLLLWLLLWVPGSTGDVTMLQMAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTC 60 

Qy 61 QRYCNASVTNSVKGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWD 12 0 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 QRYCNASVTNSVKGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWD 12 0 

Qy 121 VSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSN 18 0 

I I I I I I I M I I M I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
Db 121 VSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSN 18 0 

Qy 181 KALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNG 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 KALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNG 240 

Qy 241 QPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSP 300 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 QPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSP 300 

Qy 301 GK 302 

I I 

Db 301 GK 302 



RESULT 2 
AAE00507 

ID AAE00507 standard; protein; 302 AA. 
XX 

AC AAE00507; 
XX 

DT ll-SEP-2003 (revised) 

DT 31-JUL-2001 (first entry) 

XX 

DE Human BCMA- Immunoglobulin G Fc region fusion construct. 
XX 

KW Human; A Proliferation Inducing Ligand Receptor; APRIL- R; cytostatic; 

KW gene therapy; cancer; nephrotropic; renal disorder; autoimmune disease; 

KW carcinoma; lung; colon; breast; prostate; Grave 1 s disease; hypertension; 

KW systemic lupus erythematosus; SLE; inflammation; cardiovascular disease; 

KW B-cell lympho-prolif erative disorder; BCM; immunosuppressive disease; 

KW organ transplantation; HIV; human immunodeficiency virus; TNF; murine; 

KW tumour necrosis factor; B cell maturation protein; BCMA; fusion protein; 

KW immunoglobulin G; IgG; Fc region. 
XX 

OS Homo sapiens. 

OS Mus sp. 

OS Chimeric. 



XX 

FH Key Location/Qualif iers 

FT Protein 1. .22 

FT /label= Signal_peptide 

FT /note= "Derived from murine Ig kappa sequence" 

FT Protein 23. .302 

FT /label= Mature_human_BCMA_IgG_Fc_f usion_protein 

FT Region 23. .75 ~~ 

FT /note= "Derived from human BCMA protein" 

FT Domain 24. .302 

FT /label= Cys teine_rich_domain 

FT /note= "Derived from human BCMA" 

FT Region 76. .302 

FT /note= "Derived from human IgG Fc region" 
XX 

PN WO200124811-A1. 
XX 

PD 12-APR-2001. 
XX 

PF 05-OCT-2000; 2 000WO-US027579 . 
XX 

PR 06-OCT-1999; 99US-0157 933P . 

PR ll-FEB-2000; 2000US-018 18 07P . 

PR 30-JUN-2000; 2 000US-021568 8P . 
XX 

PA (BIO J ) BIOGEN INC. 

PA (APOT-) APOTECH R&D SA. 

XX 

PI Schneider P, Thompson J, Cachero T, Ambrose C, Rennert P; 
XX 

DR WPI; 2001-266242/27. 

DR N-PSDB; AAD03847. 
XX 

PT Treating a mammal for a condition associated with undesired cell 

PT proliferation such as cancer or carcinoma, comprises administering a 

PT composition comprising A Proliferation Inducing Ligand Receptor (APRIL-R) 

PT antagonist. 

XX 

PS Example 1; Fig 3B; 85pp; English. 
XX 

CC The invention relates to a method of treating a mammal for a condition 

CC associated with undesired cell proliferation such as cancer or carcinoma. 

CC The method involves administering a composition comprising A 

CC Proliferation Inducing Ligand Receptor (APRIL-R) also referred as B cell 

CC maturation protein (BCM or BCMA) antagonist that antagonises the 

CC interaction between APRIL and its cognate receptor (s). This method is 

CC useful for treating undesired cell proliferation such as cancer or 

CC carcinoma e.g. human lung carcinoma, colon carcinoma, breast carcinoma, 

CC prostate carcinoma, and other carcinomas whose proliferation is modulated 

CC by APRIL. It is also useful for treating autoimmune diseases (Grave's 

CC disease, systemic lupus erythematosus-SLE) ; hypertension, cardiovascular 

CC diseases, renal disorders, B-cell lympho-prolif erative disorders, 

CC immunosuppressive diseases, organ transplantation, inflammation and human 

CC immunodeficiency virus (HIV) , and for treating, suppressing or altering 

CC an immune response involving a signalling pathway between APRIL-R and its 

CC ligand. APRIL-R DNA is also useful in gene therapy. The present sequence 

CC is a fusion construct containing human APRIL-R also referred as BCMA or 



CC BCM protein, Fc region of human iinmunoglobulin G (IgG) and a signal 

CC sequence from murine Ig kappa cDNA, (Updated on ll-SEP-2 003 to 

CC standardise OS field) 
XX 

SQ Sequence 302 AA; 

Query Match 100.0%; Score 1643; DB 4; Length 302; 

Best Local Similarity 100.0%; Pred. No. 1.4e-lll; 

Matches 302; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 METDTLLLWVLLLWVPGSTGDVTMLQMAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTC 60 

I II I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I | | | | | | I I 
Db 1 METDTLLLWVLLLWVPGSTGDVTMLQMAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTC 60 

Qy 61 QRYCNASVTNSVKGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWD 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I II I 
Db 61 QRYCNASVTNSVKGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWD 120 

Qy 121 VSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSN 180 

I I I I I I I I I I M I I I I I I I I I I II I I I I I I I I I I I I I M I I I I I | I I | | | | | M | | | | | | 
Db 121 VSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSN 18 0 

Qy 181 KALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNG 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 KALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNG 240 

Qy 241 QPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSP 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 QPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSP 300 

Qy 301 GK 302 

I I 

Db 301 GK 302 



RESULT 3 
AAE15488 

ID AAE15488 standard; protein; 283 AA. 
XX 

AC AAE15488; 
XX 

DT 12-MAR-2002 (first entry) 
XX 

DE Human BCMA- immunoglobulin Fc region fusion protein. 
XX 

KW Human; transmembrane activator and intracellular CAML interactor; TACI; 

KW cytostatic; B cell maturation protein; BCMA; tumour necrosis factor; TNF; 

KW lymphoprolif erative disorder; tumour; lung; gastrointestinal; pancreatic; 

KW prostate; inflammation; immune disorder; diarrhoea; psoriasis; colitis; 

KW drug allergy; dermatitis; pneumonia; asthma; inflammatory bowel disease; 

KW Crohn 1 s disease; scleroderma; autoimmune disease; multiple sclerosis; 

KW human immunodeficiency virus; HIV; systemic lupus erythematosus; cancer; 

KW rheumatoid arthritis; atherosclerosis; fusion protein. 
XX 

OS Homo sapiens. 
XX 

PN WO200187979-A2. 



XX 

PD 22-NOV-2001. 
XX 

PF 14-MAY-2001; 2 0 01WO-US0155 67 . 
XX 

PR 12-MAY-2000; 2000US-0204039P . 

PR 27-JUN-2000; 2 000US-0214591P . 

PR 14-MAY-2001; 2 0 0 1US- 002 14 591 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Theill LE, Yu G; 
XX 

DR WPI; 2002-066686/09. 
XX 

PT Inhibiting activity of B cell maturation protein and/or transmembrane 

PT activator and intracellular cyclophilin ligand interactor, by 

PT administering a binding partner for APRIL, a tumor necrosis factor family 

PT ligand. 

XX 

PS Disclosure; Fig 10B; 94pp; English. 
XX 

CC The invention relates to a method for inhibiting TACI (transmembrane 

CC activator and intracellular CAML interactor) and/or B cell maturation 

CC protein (BCMA) activity in a mammal. The method comprises administering a 

CC specific binding partner for APRIL (G70, a tumour necrosis factor-TNF 

CC family ligand) , having the consensus region of TACI, BCMA, or the TACI/ 

CC BCMA extracellular consensus sequence, but not the extracellular region 

CC of TACI or BCMA. The method is useful for inhibiting activity of TACI 

CC and/or BCMA in a mammal which is useful for treating B-cell or T-cell 

CC lymphoprolif erative disorders, one or more solid tumours such as lung, 

CC gastrointestinal, pancreatic or prostate tumour. APRIL, BCMA and TACI 

CC antagonists are useful for treating inflammation and immune function 

CC diseases such as diarrhoea, psoriasis, allergies, pneumonia, atopic 

CC dermatitis, respiratory allergic disease (asthma, hypersensitivity lung 

CC disease) , drug and insect sting allergy, inflammatory bowel disease 

CC (Crohn's disease, colitis), scleroderma, autoimmune disease (multiple 

CC sclerosis, rheumatoid arthritis, systemic lupus erythematosus), fungal, 

CC bacterial, protozoal and viral infections (HIV), atherosclerosis, cancer 

CC with leucocyte infiltration of the skin or organs. The present sequence 

CC is human BCMA protein-immunoglobulin Fc region fusion protein 
XX 

SQ Sequence 283 AA; 

Query Match 90.6%; Score 1488.5; DB 5; Length 283; 
Best Local Similarity 97.2%; Pred. No. 2.3e-100; 

Matches 27 5; Conservative 0; Mismatches 1; Indels 7; Gaps 1; 

Qy 27 MAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTCQRYCNASVTNSVKGV DKTH 79 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I 

Db 1 MAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTCQRYCNASVTNSVKGTNAGGGGGDKTH 60 

Qy 80 TCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEV 139 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 61 TCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEV 12 0 



Qy 140 HNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPR 199 





1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 [ 1 1 1 II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

M 1 1 II II M M 1 1 1 1 II 1 1 1 II M M 1 II 1 1 1 1 1 1 II 1 II 1 1 II 1 1 1 1 1 1 | 1 1 1 M 1 [ 1 


Db 


121 HNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPR 180 


Qy 


200 EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF 2 59 




1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 


181 EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF 2 40 


Qy 


2 60 FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 


Db 


241 FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 28 3 


RESULT 4 


AAE15489 


ID 


AAE15489 standard; protein; 281 AA. 


XX 




AC 


AAE15489; 


XX 




DT 


2 9 -AUG- 2 003 ( r e vi s ed ) 


DT 


12-MAR-2002 (first entry) 


XX 




DE 


Mouse BCMA-human immunoglobulin Fc region fusion protein. 


XX 




KW 


Human; transmembrane activator and intracellular CAML interactor; TACI; 


KW 


cytostatic; B cell maturation protein; BCMA; tumour necrosis factor; TNF; 


KW 


lymphoprolif erative disorder; tumour; lung; gastrointestinal; pancreatic; 


KW 


prostate; inflammation; immune disorder; diarrhoea; psoriasis; colitis; 


KW 


drug allergy; dermatitis; pneumonia; asthma; inflammatory bowel disease; 


KW 


Crohn 1 s disease; scleroderma; autoimmune disease; multiple sclerosis; 


KW 


human immunodeficiency virus; HIV; systemic lupus erythematosus; cancer; 


KW 


rheumatoid arthritis; atherosclerosis; fusion protein; mouse. 


XX 




OS 


Homo sapiens . 


OS 


Mus sp. 


OS 


Chimeric . 


XX 




PN 


WO200187979-A2. 


XX 




PD 


22-NOV-2001. 


XX 




PF 


14-MAY-2001; 2001WO-US015567 . 


XX 




PR 


12-MAY-2000; 2000US-0204039P . 


PR 


27-JUN-2000; 2000US-0214591P . 


PR 


14-MAY-2001; 2001US-00214 591 . 


XX 




PA 


(AMGE-) AMGEN INC. 


XX 




PI 


Theill LE, Yu G; 


XX 




DR 


WPI; 2002-066686/09. 


XX 




PT 


Inhibiting activity of B cell maturation protein and/or transmembrane 


PT 


activator and intracellular cyclophilin ligand interactor, by 


PT 


administering a binding partner for APRIL, a tumor necrosis factor family 


PT 


ligand . 


XX 





PS Disclosure; Fig 10B; 94pp; English. 
XX 

CC The invention relates to a method for inhibiting TACI (transmembrane 

CC activator and intracellular CAML interactor) and/or B cell maturation 

CC protein (BCMA) activity in a mammal. The method comprises administering a 

CC specific binding partner for APRIL (G70, a tumour necrosis factor-TNF 

CC family ligand) , having the consensus region of TACI, BCMA, or the TACI/ 

CC BCMA extracellular consensus sequence, but not the extracellular region 

CC of TACI or BCMA. The method is useful for inhibiting activity of TACI 

CC and/or BCMA in a mammal which is useful for treating B-cell or T-cell 

CC lymphoprolif erative disorders, one or more solid tumours such as lung, 

CC gastrointestinal, pancreatic or prostate tumour. APRIL, BCMA and TACI 

CC antagonists are useful for treating inflammation and immune function 

CC diseases such as diarrhoea, psoriasis, allergies, pneumonia, atopic 

CC dermatitis, respiratory allergic disease (asthma, hypersensitivity lung 

CC disease) , drug and insect sting allergy, inflammatory bowel disease 

CC (Crohn's disease, colitis), scleroderma, autoimmune disease (multiple 

CC sclerosis, rheumatoid arthritis, systemic lupus erythematosus), fungal, 

CC bacterial, protozoal and viral infections (HIV), atherosclerosis, cancer 

CC with leucocyte infiltration of the skin or organs. The present sequence 

CC is mouse BCMA protein-human immunoglobulin Fc region fusion protein. 

CC (Updated on 29-AUG-2003 to standardise OS field) 
XX 

SQ Sequence 281 AA; 

Query Match 85.1%; Score 1398.5; DB 5; Length 281; 

Best Local Similarity 92.2%; Pred. No. 8.3e-94; 

Matches 261; Conservative 4; Mismatches 9; Indels 9; Gaps 2 

Qy 27 MAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTCQRYCNASVTNSVKGV DKTH 7 9 

II I I : I I I I II I M I I I I M I : I I I I I I I : I I I : I I I I I I I I 

Db 1 MAQQCFHSEYFDSLLHACKPCHLRCSN — PPATCQPYCDPSVTSSVKGS YTGGGGGDKTH 58 

Qy 80 TCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEV 139 

I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 59 TCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEV 118 

Qy 140 HNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPR 199 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 119 HN7VKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPR 178 

Qy 200 EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF 259 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I II II I I I I II I I I I I I I I I I I I I I I 
Db 179 EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF 238 

Qy 260 FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 

I I I I II I I I I II I I I I I I II I I I I I I I II I I I I I I I I I I I I I I 
Db 239 FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 2 81 



RESULT 5 
AA014133 

ID AA014133 standard; protein; 334 AA. 
XX 

AC AA014133; 
XX 

DT 29-AUG-2003 (revised) 



DT 02-MAY-2002 (first entry) 
XX 

DE Protein of hTACI ( 32-114 ) -IgGl Fc. 
XX 

KW Human transmembrane activator CAML interactor protein; TACI; cytostatic; 

KW cell proliferation; tumour; vulnerary; renal cell cancer; mastocytoma; 

KW Kaposi's sarcoma; breast; ovarian carcinoma; rectal; throat; melanoma; 

KW colon; bladder; mammary adenocarcinoma; gastrointestinal; hyperplasia; 

KW pharyngeal squamous cell; stomach; cellular hyperprolif eration; pannus; 

KW scleroderma; rheumatoid arthritis; scarring; mean survival time; liver; 

KW lung fibrosis; uterine; a proliferation inducing ligand; chimeric; 

KW hTACI (32-114) -IgGl Fc. 
XX 

OS Homo sapiens. 

OS Mus sp. 

OS Chimeric. 
XX 

FH Key 

FT Peptide 
FT 

FT Domain 
FT 
FT 

FT Region 
FT 
XX 

PN WO200181417-A2 
XX 

PD 01-NOV-2001. 
XX 

PF 27-APR-2001; 2 001WO-US 04 062 6 . 
XX 

PR 27-APR-2000; 2 000US-0199946P . 
XX 

PA (BIO J ) BIOGEN INC. 

PA (APOT-) APOTECH R&D SA. 

XX 

PI Ambrose C, Thompson J, Schneider P, Rennert P; 
XX 

DR WPI; 2002-062027/08. 

DR N-PSDB; AAK98729. 
XX 

PT Treating mammal for condition associated with undesired cell 

PT proliferation e.g., solid tumor or reducing solid tumor size located in 

PT mammal comprises administering transmembrane activator CAML interactor 

PT protein reagent . 

XX 

PS Example; Fig 4; 42pp; English. 
XX 

CC This sequence represents the protein of a human TACI extracellular domain 

CC with a truncated stalk region initiating after the second methionine 

CC fused to human IgG Fc (hTACI ( 32-114 ) -IgGl Fc) . The signal sequence from 

CC murine IgG-kappa was fused in frame with the human TACI extracellular 

CC domain. The invention relates to treating a mammal for a condition 

CC associated with undesired cell proliferation (e.g. a solid tumour, or 

CC reducing the size of a solid tumour located on or in a mammal) comprising 

CC administering a transmembrane activator CAML interactor protein (TACI) 



Location/ Qualifiers 
1. .22 

/note= "Signal peptide from murine IgG-kappa" 
23. .105 

/label= Extracellularjdomain 

/note= "Human TACI extracellular domain" 

106. .334 

/note= "hlgGl Fc sequence" 



CC reagent. The TACI reagent has cytostatic and vulnerary activity. Treating 

CC a mammal (e.g. human, cow, horse, dog, mouse, rat or cat) for a condition 

CC associated with undesired cell proliferation (e.g. cancer such as renal 

CC cell cancer, Kaposi f s sarcoma, breast cancer, sarcoma, ovarian carcinoma, 

CC rectal cancer, throat cancer, melanoma, colon cancer, bladder cancer, 

CC mastocytoma, lung cancer, mammary adenocarcinoma, pharyngeal squamous 

CC cell carcinoma, gastrointestinal cancer or stomach cancer) . The method is 

CC also useful for treating cellular hyperprolif eration (hyperplasia) such 

CC as scleroderma, pannus formation in rheumatoid arthritis, post-surgical 

CC scarring and lung, liver and uterine fibrosis. The TACI reagent of the 

CC invention can extend mean survival time of a mammal by 25% as compared to 

CC the mean survival time of a mammal in the absence of administering the 

CC TACI reagent. The TACI reagent also reduces the size of the tumour by 25% 

CC or more. (Updated on 29-AUG-2003 to standardise OS field) 
XX 

SQ Sequence 334 AA; 

Query Match 83.9%; Score 1379; DB 5; Length 334; 

Best Local Similarity 77.1%; Pred. No. 2.7e-92; 

Matches 262; Conservative 10; Mismatches 24; Indels 44; Gaps 3 

Qy 1 MET DTLLLWVLLLWVPGSTGDVTMLQMAGQCSQNEYFDSLLHACIPCQLRCS SNTP 56 

I I I I I I I I I I I I I I I I I I I I I I I : : I : I I I I : I : I : : 

Db 1 METDTLLLWVLLLWVPGSTGDVR SCPEEQYWDPLLGTCMSCKTICNHQSQRTCA 54 

Qy 57 PLTCQRYCNASVTNSVK-GVDKTHTCP 82 

I I : I : : I I I I I I I I I 

Db 55 AFCRSLSCRKEQGKFYDHLLRDCISCASICGQHPKQCAYFCENKLRSPVNLPVDKTHTCP 114 

Qy 83 PCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNA 142 

I I I I I I I I I M I I I I I I I I I I I I II I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 

Db 115 PCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNA 174 

Qy 143 KTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQ 202 

M II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I 1 I I I I 
Db 175 KTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQ 234 

Qy 2 03 VYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY 2 62 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I 

Db 235 VYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY 2 94 

Qy 263 SKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 295 SKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 334 



RESULT 6 
AA014132 

ID AA014132 standard; protein; 366 AA. 
XX 

AC AA014132; 
XX 

DT 29-AUG-2003 (revised) 

DT 02-MAY-2002 (first entry) 

XX 

DE Protein of hTACI ( 1-114) -IgGl . 
XX 



KW Human transmembrane activator CAML interactor protein; TACI ; cytostatic; 

KW cell proliferation; tumour; vulnerary; renal cell cancer; mastocytoma; 

KW Kaposi's sarcoma; breast; ovarian carcinoma; rectal; throat; melanoma; 

KW colon; bladder; mammary adenocarcinoma; gastrointestinal; hyperplasia; 

KW pharyngeal squamous cell; stomach; cellular hyperprolif eration; pannus; 

KW scleroderma; rheumatoid arthritis; scarring; mean survival time; liver; 

KW lung fibrosis; uterine; a proliferation inducing ligand; hlgGl Fc; IgGl; 

KW hTACI; chimeric. 
XX 

OS Homo sapiens. 

OS Mus sp . 

OS Chimeric . 
XX 

FH Key Location/Quali f iers 

FT Peptide 1. .23 

FT /note= "Signal peptide from murine IgG-kappa" 

FT Domain 24. .137 

FT /label= Extracellular_domain 

FT /note= "Human TACI extracellular domain" 

FT Region 138. .366 

FT /note= "hlgGl Fc sequence" 

XX 

PN WO200181417-A2. 
XX 

PD 01-NOV-2001. 
XX 

PF 27-APR-2001; 2001WO-US040626 . 
XX 

PR 27-APR-2000; 2 00 OUS-019 994 6P . 
XX 

PA (BIOJ ) BIOGEN INC. 

PA (APOT-) APOTECH R&D SA. 

XX 

PI Ambrose C, Thompson J, Schneider P, Rennert P; 
XX 

DR WPI; 2002-062027/08. 

DR N-PSDB; AAK98728. 
XX 

PT Treating mammal for condition associated with undesired cell 

PT proliferation e.g., solid tumor or reducing solid tumor size located in 

PT mammal comprises administering transmembrane activator CAML interactor 

PT protein reagent. 

XX 

PS Disclosure; Fig 3; 42pp; English. 
XX 

CC This sequence represents the protein of a human TACI extracellular domain 

CC with a truncated stalk region fused to human IgG Fc, called hTACI (1-114) 

CC -IgGl. The signal sequence from murine IgG-kappa was fused in frame with 

CC the human TACI extracellular domain. The invention relates to treating a 

CC mammal for a condition associated with undesired cell proliferation (e.g. 

CC a solid tumour, or reducing the size of a solid tumour located on or in a 

CC mammal) comprising administering a transmembrane activator CAML 

CC interactor protein (TACI) reagent. The TACI reagent has cytostatic and 

CC vulnerary activity. Treating a mammal (e.g. human, cow, horse, dog, 

CC mouse, rat or cat) for a condition associated with undesired cell 

CC proliferation (e.g. cancer such as renal cell cancer, Kaposi's sarcoma, 

CC breast cancer, sarcoma, ovarian carcinoma, rectal cancer, throat cancer, 



CC melanoma, colon cancer, bladder cancer, mastocytoma, lung cancer, mammary 

CC adenocarcinoma, pharyngeal squamous cell carcinoma, gastrointestinal 

CC cancer or stomach cancer) . The method is also useful for treating 

CC cellular hyperprolif eration (hyperplasia) such as scleroderma, pannus 

CC formation in rheumatoid arthritis, post-surgical scarring and lung, liver 

CC and uterine fibrosis. The TACI reagent of the invention can extend mean 

CC survival time of a mammal by 25% as compared to the mean survival time of 

CC a mammal in the absence of administering the TACI reagent. The TACI 

CC reagent also reduces the size of the tumour by 25% or more. (Updated on 

CC 29-AUG-2003 to standardise OS field) 

XX 

SQ Sequence 366 AA; 

Query Match 83.7%; Score 1376; DB 5; Length 366; 

Best Local Similarity 72.1%; Pred. No. 5e-92; 

Matches 264; Conservative 11; Mismatches 27; Indels 64; Gaps 3 

Qy 1 METDTLLLWVLLLWVPGSTGDVTMLQMA GQCSQN 34 

I I I I I II I I I I I I I I I I I I I I I I I : I : 

Db 1 METDTLLLWVLLLWVPGSTGDVTMSGLGRSRRGGRSRVDQEERFPQGLWTGVAMRSCPEE 60 

Qy 35 EYFDSLLHACIPCQLRCSSNTP P 57 

: I : I I I I : I : I : : I 
Db 61 QYWDPLLGTCMSCKTICNHQSQRTCAAFCRSLSCRKEQGKFYDHLLRDCISCASICGQHP 120 

Qy 58 LTCQRYCNASVTNSVK-GVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTC 116 

I : I : : I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 1 I I I I I 
Db 121 KQCAYFCENKLRSPVNLPVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTC 180 

Qy 117 VWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKC 176 

I I I II I I I I I I I I I I I I I E I I I I I I I I I II I I I I II I I I I I I II I I I I I I I I I I M I I II 
Db 181 VWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKC 24 0 

Qy 177 KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEW 236 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I II I I 
Db 241 KVSNKALPAPIEKTISKAKGQPREPQVTTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEW 300 

Qy 237 ESNGQPENNYKTT P PVLDSDGS FFLYS KLTVDKS RWQQGNVFS CS VMHEALHNH YTQKS L 296 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 ESNGQPENNYKTTP PVLDSDGS FFLYS KLTVDKS RWQQGNVFS CSVMHEAiHNH YTQKS L 360 

Qy 297 SLSPGK 302 

I I I I I I 

Db 361 SLSPGK 366 



RESULT 7 
AAE22245 

ID AAE22245 standard; protein; 320 AA. 
XX 

AC AAE22245; 
XX 

DT 29-AUG-2003 (revised) 

DT 25-JUL-2002 (first entry) 

XX 

DE Murine IgG-kappa signal sequence-human BAFF-R:Fc fusion protein. 
XX 



KW Human; BAFF receptor; BAFF-R; cytostatic; hypotensive; inflammation; TNF; 

KW Tumour Necrosis Factor; autoimmune disease; immunosuppressive; cancer; 

KW myasthenia gravis; hypertension; organ transplantation; drug screening; 

KW HIV; human immunodeficiency virus; genetic disorder; cardiovascular; 

KW renal; rheumatoid arthritis; systemic lupus erythematosus; amyloidosis; 

KW haemolytic anaemia; Chagas 1 disease; Grave's disease; glomerulonephritis; 

KW multiple myeloma; chromosomal mapping; tissue typing; drug screening; 

KW murine; immunoglobulin G; IgG; fusion protein. 
XX 

OS Homo sapiens . 

OS Mus mus cuius. 

OS Chimeric. 
XX 

FH Key Location/Qualifiers 

FT Peptide 1. .21 

FT /label= Signal_peptide 

FT /note= "Murine IgG-kappa signal sequence" 

FT Protein 22. .320 

FT /note= "Mature human BAFF-R: Fc protein" 

FT Misc-dif f erence 22 

FT /note= "Encoded by region introducing restriction site" 

FT Domain 23. .92 

FT /note= "Human BAFF-R extracellular domain" 

FT Misc-dif f erence 93 

FT /note= "Encoded by region introducing restriction site" 

FT Region 94. .320 

FT /note= "Human IgGl Fc region" 

XX 

PN WO200224909-A2 . 
XX 

PD 28-MAR-2002. 
XX 

PF 06-SEP-2001; 2 00 1WO-US02 8 006 . 
XX 

PR 18-SEP-2000; 2000US-0233152P . 

PR 21-SEP-2000; 2000US-0234 14 OP . 

PR 13-FEB-2001; 2 001US-0268499P . 

PR 14-AUG-2001; 2001US-0312185P . 
XX 

PA (BIO J ) BIOGEN INC. 
XX 

PI Ambrose CM, Thompson JS ; 
XX 

DR WPI; 2002-362428/39. 

DR N-PSDB; AAD35412. 
XX 

PT New human BAFF receptor proteins and nucleic acids, useful for treating, 

PT preventing or delaying e.g. autoimmune diseases, cancers, inherited 

PT genetic disorders involving B-cells, cardiovascular disorders, or renal 

PT disorders. 

XX 

PS Example 9; Fig 9; 164pp; English. 
XX 

CC The invention relates to human BAFF receptor (BAFF-R) nucleic acids and 

CC proteins. BAFF-R is a B-cell activating factor belonging to the Tumour 

CC Necrosis Factor (TNF) family, which is associated with the expression of 

CC B-cells and immunoglobulins. The BAFF-R proteins, DNA and antibodies are 



CC useful for treating, preventing or delaying autoimmune diseases, cancer, 

CC tumourigenic conditions or inherited genetic disorders involving B-cells, 

CC hypertension, cardiovascular disorders, immunosuppressive diseases, renal 

CC disorders, inflammation, organ transplantation and HIV. Autoimmune 

CC diseases, which can be treated or prevented by BAFF-R, include systemic 

CC lupus erythematosus, rheumatoid arthritis, myasthenia gravis, autoimmune 

CC haemolytic anaemia, idiopathic thrombocytopaenia purpura, Chagas 1 disease 

CC Grave's disease, anti-phospholipid syndrome, Wegener's granulomatosis, 

CC poly-arteritis nodosa and rapidly progressive glomerulonephritis. Plasma 

CC cells disorders e.g., multiple myeloma, Waldenstrom' s , macroglobulinaemia, 

CC heavy-chain disease, primary or immunocyte-associated amyloidosis, and 

CC monoclonal gammopathy of undetermined significance. The nucleic acids, 

CC protein, protein homologues, and antibodies may further be used in 

CC screening assays, in detection assays (chromosomal mapping, tissue typing 

CC or forensic biology), predictive medicine (e.g. diagnostic or prognostic 

CC assays, monitoring clinical trials, or pharmacogenomic) . The polypeptides 

CC are further useful as immunogens to raise anti-BFFR antibodies, or in 

CC screening drugs or compounds that modulate BAFF-R activity or expression. 

CC The present sequence is a fusion protein containing murine IgG-kappa 

CC signal sequence linked to human BAFF-R and human immunoglobulin G (IgG) 

CC Fc region. (Updated on 29-AUG-2003 to standardise OS field) 
XX 

SQ Sequence 320 AA; 

Query Match 82.8%; Score 1361; DB 5; Length 320; 

Best Local Similarity 82.2%; Pred. No. 5.2e-91; 

Matches 263; Conservative 5; Mismatches 34; Indels 18; Gaps 3 

METDTLLLWVLLLWVPGSTGDVTM— LQMAGQ CSQNEYFDSLLHACI PCQL-- 4 9 

I I I I I I I I I I I I I I I I II I I II : I : I MM: I : I I 

METDTLLLWVLLLWVPGSTGDVRRGPRSLRGRDAPAPTPCVPAECFDLLVRHCVACGLLR 60 

RCSSNTPPLTCQRYCNASVTNSVKGVDKTHTCPPCPAPELLGGPSVFLFPPKP 102 

M | | : I I I I I I I I I I I I I I I I II I I I I I I I I I I 

TPRPKPAGASSPAPRTALQPQESVGAGAGEAAVDKTHTCPPCPAPELLGGPSVFLFPPKP 12 0 

KDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLT 162 

I | | | | | | | | | I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
KDTLMISRTPEVTCWVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLT 18 0 

VLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTC 222 

| | | || | | | | | I I I I I I II II I I I I I I I I I II II I I I II I I I I I I M I I I I I I I I I I I I I I 
VLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTC 24 0 

LVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSV 2 82 
I I M I I I I I I II I I I I I I I M I I I I II I II II I I M I I I I I I I I I I I I I I I I I I I I I M I 



I I I I I I II I I I I I I I I I M 



Qy 


1 


Db 


1 


Qy 


50 


Db 


61 


Qy 


103 


Db 


121 


Qy 


163 


Db 


181 


Qy 


223 


Db 


241 


Qy 


283 


Db 


301 



RESULT 8 
ABR55865 

ID ABR55865 standard; protein; 320 AA. 
XX 



AC ABR55865; 
XX 

DT 02-SEP-2003 (first entry) 
XX 

DE Human immunoglobulin G (IgG) Fc portion. 
XX 

KW Peptide remodeling; glycocon j ugation; glycosyltrans f erase ; glycan; 

KW immunoglobulin G; IgG; Fc; human. 
XX 

OS Homo sapiens. 
XX 

PN WO2003031464-A2. 
XX 

PD 17-APR-2003 . 
XX 

PF 09-OCT-2002; 2 002WO-US 0322 63 . 
XX 

PR 10-OCT-2001; 

PR 19-OCT-2001; 

PR 28-NOV-2001; 

PR 28-NOV-2 001; 

PR 07-JUN-2002; 

PR 25-JUN-2002; 

PR 17-JUL-2002; 

PR 16-AUG-2002; 

PR 2 8-AUG-2 0 02; 
XX 

PA (NEOS-) NEOSE TECHNOLOGIES INC. 
XX 

PI De Frees S, Zopf D, Bayer R, Bowe C, Hakes D, Chen X; 
XX 

DR WPI; 2003-449162/42. 

DR N-PSDB; ACC78891. 
XX 

PT Remodeling a peptide, by removing a saccharyl subunit from the peptide to 

PT form truncated glycan, and adding or deleting glycosyl groups to a 

PT peptide and/or adding modifying group of a peptide to remodel the 

PT peptide. 

XX 

PS Example; Fig 75B; 900pp; English. 
XX 

CC The invention relates to a cell-free, in vitro method of remodeling a 

CC peptide. The method involves removing a saccharyl subunit from the 

CC peptide, thus forming a truncated glycan, and contacting the truncated 

CC glycan with at least one glycosyltrans f erase and at least one glycosyl 

CC donor under conditions suitable to transfer at least one glycosyl donor 

CC to the truncated glycan, thus remodeling the peptide. Conjugates can be 

CC formed between a granulocyte colony stimulating factor (G-CSF) peptide, 

CC interferon alpha peptide, interferon beta peptide, Factor Vila peptide, 

CC Factor IX peptide, follicle stimulating hormone peptide, erythropoietin 

CC (EPO) peptide, granulocyte macrophage colony stimulating factor (GM-CSF) 

CC peptide, interf eron-gamma peptide, alpha-l-protease inhibitor (A-l-PI) 

CC peptide, beta-glucosidase peptide, tissue plasminogen activator (TPA) 

CC peptide, interleukin-2 (IL-2) peptide, Factor VIII peptide, TNFalpha 

CC receptor/immunoglobulin (Ig) G fusion peptide, urokinase peptide, anti- 

CC glycoprotein Ilb/lIIa monoclonal antibody peptide, chimeric anti HER2 

CC antibody peptide, anti-respiratory syncytial virus (RSV) F peptide, anti- 



2001US-0328523P. 
2001US-0344692P. 
2001US-0334233P. 
2001US-0334301P. 
2002US-0387292P. 
2002US-0391777P. 
2002US-0396594P. 
2002US-0404249P. 
2002US-0407527P. 



CC CD20 antibody peptide, recombinant DNase peptide, anti-TNF alpha peptide, 

CC insulin peptide, hepatitis B surface antigen (HbsAg) , human growth 

CC hormone (HGH) peptide, and a modifying group, where the modifying group 

CC is covalently attached to the peptide through an intact glycosyl linking 

CC group. The method is useful for a cell-free, in vitro method of 

CC remodeling the above mentioned peptides. The present sequence represents 

CC a human immunoglobulin G (IgG) Fc portion 

XX 

SQ Sequence 320 AA; 

Query Match 82.8%; Score 1361; DB 6; Length 320; 

Best Local Similarity 82.2%; Pred. No, 5.2e-91; 

Matches 2 63; Conservative 5; Mismatches 34; Indels 18; Gaps 3 

Qy 1 METDTLLLWVLLLWVPGSTGDVTM — LQMAGQ CSQNEYFDSLLHACIPCQL — 49 

I I I II I I I I I I I I I I I I I I I I I : I : I lllh I : I I 

Db 1 METDTLLLWVLLLWVPGSTGDVRRGPRSLRGRDAPAPTPCVPAECFDLLVRHCVACGLLR 60 

Qy 50 RCSSNTPPLTCQRYCNASVTNSVKGVDKTHTCPPCPAPELLGGPSVFLFPPKP 102 

II I I : I I I I I II I I I I II I I I I I 1 I I I I M I I I 

Db 61 TPRPKPAGASSPAPRTALQPQESVGAGAGEAAVDKTHTCPPCPAPELLGGPSVFLFPPKP 120 

Qy 103 KDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLT 162 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 121 KDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLT 180 

Qy 163 VLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTC 222 

I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I II 
Db 181 VLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTC 24 0 

Qy 223 LVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSV 282 

I I I II I I I I I I I I II I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 LVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSV 300 

Qy 283 MHEALHNHYTQKSLSLSPGK 302 

I I I I I I I I I I I I I I I I I I II 
Db 301 MHEALHNHYTQKSLSLSPGK 320 



RESULT 9 
AAE35228 

ID AAE35228 standard; protein; 332 AA. 
XX 

AC AAE35228; 
XX 

DT 23-OCT-2 0 03 (revised) 

DT 28-MAY-2003 (first entry) 

XX 

DE Human TACI-Fc5 fusion protein comprising murine 26-10 VH signal peptide. 
XX 

KW Transmembrane activator; calcium modulator; nephrotropic; antibacterial; 

KW TACI; tumour necrosis factor-like protein; ZTNF2; ZTNF4; immunoglobulin; 

KW anaemia; gene therapy; cytostatic; antiinflammatory; immunosuppressive; 

KW glomerulonephritis; asthma; bronchitis; graft rejection; septic shock; 

KW dermatological; neuroprotective; cyclophilin ligand-interactor ; human; 

KW autoimmune disease; systemic lupus erythematosus; multiple sclerosis; 

KW diabetes mellitus; rheumatoid arthritis; renal disease; inflammation; 



KW mouse; Fc5 protein; chimeric. 
XX 

OS Homo sapiens . 

OS Mus sp . 

OS Chimeric. 
XX 

PN WO200294852-A2 . 
XX 

PD 28-NOV-2002. 
XX 

PF 20-MAY-2002; 2 002WO-US015910 . 
XX 

PR 24-MAY-2001; 2001US-0293343P . 
XX 

PA (ZYMO ) ZYMOGENETICS INC. 
XX 

PI Rixon MW, Gross JA; 
XX 

DR WPI; 2003-148455/14. 
XX 

PT Transmembrane activator and calcium modulator and cyclophilin ligand- 

PT interactor (TACI ) -immunoglobulin fusion protein, for treating cancer or 

PT diabetes, comprises a TACI receptor group and an immunoglobulin group. 
XX 

PS Disclosure; Col 137-138; 71pp; English. 
XX 

CC The invention relates to fusion proteins comprising transmembrane 

CC activator and calcium modulator and cyclophilin ligand-interactor (TACI) 

CC receptor group that binds tumour necrosis factor-like protein (ZTNF)2 or 

CC ZTNF4; and an immunoglobulin group comprising a constant region of an 

CC immunoglobulin. The invention is used to manufacture a medicament for 

CC inhibiting the proliferation of tumour cells in a mammalian subject. The 

CC composition comprising the fusion protein may also be used in treating 

CC autoimmune diseases (e.g. systemic lupus erythematosus, multiple 

CC sclerosis, diabetes mellitus, rheumatoid arthritis and asthma), renal 

CC diseases (e.g. glomerulonephritis), bronchitis, inflammation, graft 

CC rejection, anaemia and septic shock. The fusion proteins are also used in 

CC gene therapy. The present sequence is human TACI-Fc5 fusion protein 

CC comprising murine 26-10 VH signal peptide. This sequence is used in the 

CC invention. (Updated on 23-OCT-2003 to standardise OS field) 

XX 

SQ Sequence 332 AA; 

Query Match 77.4%; Score 1271; DB 6; Length 332; 
Best Local Similarity 72.2%; Pred. No. 1.9e-84; 

Matches 239; Conservative 15; Mismatches 37; Indels 40; Gaps 2 

Qy 9 WVLLLWVPGSTGDVTMLQMAGQCSQNEYFDSLLHACIPCQLRCSSNTP 56 

I : I : I : I : I I : :l : I I I I : I : I : : 

Db 5 WIFLFLLSGTAG VLSAMRSCPEEQYWDPLLGTCMSCKTICNHQSQRTCAAFCRSLSC 61 

Qy 57 PLTCQRYCNASVTNSVKGVDKTHTCPPCPAPELLG 91 

I ! : I : : I I I I I II I I I I I I I I 

Db 62 RKEQGKFYDHLLRDCISCASICGQHPKQCAYFCENKLRSEPKSSDKTHTCPPCPAPEAEG 121 

Qy 92 GPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQY 151 
I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I M I I I I I I I I M I I I I I I I I I I I I M I 



Db 



122 APSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQY 181 



Qy 



152 NSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRD 211 





Db 



182 NSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPSSIEKTISKAKGQPREPQVYTLPPSRD 241 



Qy 



212 ELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSR 271 




Db 



2 42 ELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSR 301 



Qy 



272 WQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 




Db 



302 WQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 332 



RESULT 10 
AAE13055 

ID AAE13055 standard; protein; 446 AA. 
XX 

AC AAE13055; 
XX 

DT 28-JAN-2002 (first entry) 
XX 

DE ADAM-17dis-Fc fusion construct. 
XX 

KW Human; ADAM disintegrin domain; integrin; endothelial cell migration; 

KW angiogenesis ; ocular disorder; inflammatory disease; bone resorption; 

KW osteoporosis; restenosis; thrombosis; tissue repair; wound healing; 

KW retinopathy; retinoblastoma; neovascular glaucoma; macular degeneration; 

KW retrolental fibroplasia; inflammatory bowel disease; rubeosis; uveitis; 

KW arthritis; rheumatism; myocardial infarction; coronary artery disease; 

KW tumour; stroke; atherosclerosis; arteriosclerosis; injury; psoriasis; 

KW preeclampsia; embolism; ischaemia; fusion construct; immunoglobulin K; 

KW IgK; Fc region; antiinflammatory; osteopathic; vasotropic; thrombolytic. 
XX 

OS Homo sapiens. 

OS Unidentified. 

OS Chimeric . 
XX 

FH Key Location/Qualifiers 

FT Peptide 1. .20 

FT /label= IgK_leader_sequence 

FT Protein 21. .446 

FT /note= "Mature ADAM-17dis-Fc fusion construct" 

FT Region 23. .216 

FT /note= "Human ADAM disintegrin" 

FT Domain 34. .93 

FT /label= Human_ADAM__disintegrin_domain 

FT Region 219. .44 6 

FT /note= "Fc region" 

XX 

PN WO200162905-A2 . 
XX 

PD 30-AUG-2001. 
XX 

PF 23-FEB-2001; 2 00 1WO-US0057 01 . 
XX 



PR 25-FEB-2000; 2000US-0184865P . 
XX 

PA (IMMV ) IMMUNEX CORP. 
XX 

PI Fanslow WC, Cerretti DP, Poindexter KM, Black RA; 
XX 

DR WPI; 2001-625725/72. 

DR N-PSDB; AAD21435. 
XX 

PT Antagonizing the binding of an integrin to its ligand useful for the 

PT treatment of angiogenesis comprises administration of an ADAM-disintegrin 

PT domain polypeptide. 

XX 

PS Claim 11; Page 46-47; 66pp; English. 
XX 

CC The invention relates to the method and use of ADAM disintegrin domain 

CC polypeptides for inhibiting the biological activity of integrins, 

CC endothelial cell migration and angiogenesis. ADAM disintegrin domain 

CC polypeptides are used for treatment of ocular disorders, malignant and 

CC metastatic conditions, inflammatory diseases, osteoporosis and other 

CC conditions mediated by accelerated bone resorption, restenosis, 

CC inappropriate platelet activation, recruitment or aggregation, thrombosis 

CC or a condition requiring aggregation, thrombosis or a condition requiring 

CC tissue repair or wound healing, angiogenesis, ocular neovascularisation 

CC or solid tumour, diabetic retinopathy, retinopathy or prematurity, 

CC neovascular glaucoma, retinoblastoma, retrolental fibroplasias, rubeosis, 

CC uveitis, macular degeneration and corneal graft neovascularisation, 

CC inflammatory diseases, ocular tumours, diseases associated with choroidal 

CC or iris neovascularisation, arthritis, rheumatism, inflammatory bowel 

CC disease, psoriasis, coronary artery disease or injury, myocardial 

CC infarction or injury following myocardial infarction, stroke, unstable 

CC angina, atherosclerosis, arteriosclerosis, preeclampsia, embolism, 

CC platelet-associated ischaemic disorders including lung ischaemia, 

CC coronary ischaemia, cerebral ischaemia, restenosis following percutaneous 

CC coronary intervention including angioplasty, atherectomy, stent placement 

CC and bypass surgery, thrombotic disorders including coronary artery 

CC thrombosis, cerebral artery thrombosis, intracardiac thrombosis, 

CC peripheral artery thrombosis, venous thrombosis, thrombosis and 

CC coagulopathies associated with exposure to a foreign or injured tissue 

CC surface and reocclusion following thrombosis, deep venous thrombosis, 

CC pulmonary embolism, transient ischaemic attacks and another conditions 

CC where vascular occlusion is a common underlying feature, in individuals 

CC at high risk for thrombus formation of reformation, advanced coronary 

CC artery disease, or for occlusion, reocclusion, stenosis and/or restenosis 

CC of blood vessels or stroke benign tumours and preneoplastic conditions, 

CC myocardial angiogenesis, haemophilic joints, scleroderma, vascular 

CC adhesions, asthma and allergy, eczema and dermatitis, graft versus host 

CC disease, sepsis, adult respiratory distress syndrome, telangiectasia and 

CC wound granulation. The method are used in combination with angioplasty 

CC procedures, such as balloon angioplasty, laser angioplasty, coronary 

CC atherectomy or similar techniques, carotid endarterectomy, anastomosis of 

CC vascular grafts, surgery having a high risk of thrombus formation (i.e. 

CC coronary bypass surgery, insertion of a prosthetic valve or vessel and 

CC the like) , atherectomy, stent placement, placement of a chronic 

CC cardiovascular device such as an in-dwelling catheter or prosthetic valve 

CC or vessel, organ transplantation or bypass surgery. The present sequence 

CC is ADAM disintegrin domain polypeptide fusion construct. The fusion 



CC construct comprises of immunoglobulin K (IgK) leader, human ADAM 

CC disintegrin and Fc region 

XX 

SQ Sequence 446 AA; 

Query Match 77.2%; Score 1269; DB 4; Length 446; 

Best Local Similarity 57.2%; Pred. No. 3.9e-84; 

Matches 261; Conservative 2; Mismatches 29; Indels 164; Gaps ' 

Qy 1 METDTLLLWVLLLWVPGSTG DVTMLQMAGQC 31 

I I I I I I I I I I I I I I I I I I II III: II 

Db 1 METDTLLLWVLLLWVPGSTGTSCGNSRVDEGEECDPGIMYLNNDTCCNSDCT-LKEGVQC 59 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



32 SQNEYFDSLLHACIPCQLRCSSNTPPLTCQRYCNAS 67 

I I I I I MM: 

60 SDRN SPCCKNCQFETAQKKCQEAINATCKGVSYCTGNSSECPPPGNAEDDT 110 

68 VTNSVK 73 

I I I 

111 VCLDLGKCKDGKCIPFCEREQQLESCACNETDNSCKVCCRDLSGRCVPYVDAEQKNLFLR 17 0 

74 GVDKTHTCPPCPA 86 

II I II I II I I I 

171 KGKPCTVGFCDMNGKCEKRVQDVIERFWDFIDQLSINTFGKFLADNRSCDKTHTCPPCPA 230 

87 PELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKP 146 

II I II II I M II II I II II I M I M II II II I I II I I I II I M I I I I I I II I II I I I 

231 PEAEGAP SVFLFP PKPKDTLMI S RT PE VTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKP 290 



147 



206 



REEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTL 

I M II II I I II I I I II I I II I II II I II II I I II I II II I II I II II I I M I I II II I II 
291 REEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTL 350 



207 



266 



PPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLT 

II II I I II I II II II II II II II II II I I I II II II II I I I I II II I I II II I I I II I I I 
351 PPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLT 410 



267 



302 



VDKSRWQQGNVFSCS VMHEALHNHYTQKSLSLSPGK 

II I II II I II I II I I I II II II II I I M I I II II I I 

411 VDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 446 



RESULT 11 
AAY84965 

ID AAY84965 standard; protein; 633 AA. 
XX 

AC AAY84965; 
XX 

DT 21-AUG-2000 (first entry) 
XX 

DE Amino acid sequence of a CD-20 specific chimeric receptor. 
XX 

KW CD20-specif ic receptor; CD-20 speciifc redirected T cell; leukemia; 

KW CD20+ malignancy; non-Hodgkin ' s lymphoma; myeoablative chemotherapy; 

KW stem cell rescue; autoimmune disease; lupus; rheumatoid arthritis. 
XX 

OS Synthetic. 



OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 



Mus sp. 



Peptide 



Key 

Region 



Location/ Qualifiers 
21. .126 

/note= "anti-CD20 variable regions" 
27. .86 



Region 



Region 



Region 



Peptide 



/note= "signal peptide from murine T86.66 antibody kappa 
light chain" 
145. .266 

/note= "GS18 linker" 
283. .392 

/note= "hinge region" 
393. .499 

/note= "CH3 region" 
500. .521 



/note= "CD4 transmembrane region" 



Region 



522. .633 

/ note- " zeta chain" 



XX 

PN WO200023573-A2. 
XX 

PD 27-APR-2000. 
XX 

PF 20-OCT-1999; 99WO-US02 44 8 4 . 
XX 

PR 20-OCT-1998; 98US-0105014P . 
XX 

PA (CITY ) CITY OF HOPE. 
XX 

PI Raubitschek A, Jensen MC, Wu AM; 
XX 

DR WPI; 2000-339676/29. 

DR N-PSDB; AAA15019. 
XX 

PT Genetically engineered CD2 0-specif ic redirected T cells useful for 

PT treating a CD20 + malignancy,, such as non-Hodgkin ' s lymphoma or CD20+ 

PT acute or chronic leukemia, and autoimmune disease. 
XX 

PS Claim 10; Page 53-55; 58pp; English. 
XX 

CC The present sequence represents a synthetic CD20-specif ic chimeric 

CC receptor. The specification describes CD-20 specific redirected T cells 

CC which express and bear on the cell surface membrane a CD2 0-chimeric 

CC receptor comprising an intracellular signalling domain, a transmembrane 

CC domain and an extracellular domain, the extracellular domain comprising a 

CC CD20-specif ic receptor. The genetically engineered CD2 0-specif ic 

CC redirected T cells are useful for treating a CD20 + malignancy, such as 

CC non-Hodgkin ' s lymphoma or CD20+ acute or chronic leukemia, in a human 

CC patient having previously undergone myeoablative chemotherapy and stem 

CC cell rescue. The genetically engineered CD20-specif ic redirected T cells 

CC are also useful for abrogating an untoward B cell function, such as 

CC autoimmune disease (lupus or rheumatoid arthritis) in a patient 

XX 

SQ Sequence 633 AA; 



Query Match 77.2%; Score 1268.5; DB 3; Length 633; 

Best Local Similarity 53.1%; Pred. No. 6.4e-84; 



Matches 2 65; Conservative 



9; Mismatches 28; Indels 197; Gaps 5 



Qy 1 METDTLLLWVLLLWVPGSTGDVTMLQMAGQCSQNE 35 

I I I I I I I I I I I I I I I I I I I I I : : I I : 

Db 1 METDTLLLWVLLLWVPGSTGDIVLTQSPAILSASPGEKVTMTCRASSSVNYMDWYQKKPG 60 

Qy 36 YFDSLLHACIP CQ 48 

I I I : : I II 
Db 61 SSPKPWIYATSNLAS GVPARFSGSGSGTSYSLTISRVEAEDAATYYCQQWSFNPPTFGGG 12 0 

Qy 49 48 

Db 121 TKLEIKGSTSGGGSGGGSGGGGSSEVQLQQSGAELVKPGASVKMSCKASGYTFTSYNMHW 18 0 

Qy 4 9 LRCSSNTPPLTCQRYC 64 

: : I I I : II 

Db 181 VKQTPGQGLEWIGAIYPGNGDTSYNQKFKGKATLTADKSSSTAYMQLSSLTSEDSADYYC 240 

Qy 65 NASVTNSV KGVDKTHTCPPCPAPELLGGPSVFLFPPKPK 103 

I 1:1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 ARSNYYGSSYWFFDWGAGTTVTVSSLDPKSSDKTHTCPPCPAPELLGGPSVFLFPPKPK 300 

Qy 104 DTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTV 163 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I 
Db 301 DTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTV 360 

Qy 164 LHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCL 223 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I i I I I I M I I I I I I I I I I I I I I M I I I I I 
Db 361 LHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCL 42 0 

Qy 224 VKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVM 283 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 VKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVM 4 80 

Qy 284 HEALHNHYTQKSLSLSPGK 302 

I I I II M I I I I I I I I I I I I 

Db 481 HEALHNHYTQKSLSLSPGK 499 



RESULT 12 
AAE13053 

ID AAE13053 standard; protein; 465 AA. 
XX 

AC AAE13053; 
XX 

DT 28-JAN-2002 (first entry) 
XX 

DE ADAM-lOdis-Fc fusion construct. 
XX 

KW Human; ADAM disintegrin domain; integrin; endothelial cell migration; 

KW angiogenesis ; ocular disorder; inflammatory disease; bone resorption; 

KW osteoporosis; restenosis; thrombosis; tissue repair; wound healing; 

KW retinopathy; retinoblastoma; neovascular glaucoma; macular degeneration; 

KW retrolental fibroplasia; inflammatory bowel disease; rubeosis; uveitis; 

KW arthritis; rheumatism; myocardial infarction; coronary artery disease; 

KW tumour; stroke; atherosclerosis; arteriosclerosis; injury; psoriasis; 

KW preeclampsia; embolism; ischaemia; fusion construct; immunoglobulin K; 



KW IgK; Fc region; antiinflammatory; osteopathic; vasotropic; thrombolytic. 
XX 

OS Homo sapiens. 

OS Unidentified. 

OS Chimeric. 
XX 

FH Key Location/Qualifiers 

FT Peptide 1. .20 

FT /label= IgK_leader_sequence 

FT Protein 21. .465 

FT /note= "Mature ADAM-lOdis-Fc fusion construct" 

FT Region 23. .235 

FT /note= "Human ADAM disintegrin" 

FT Domain 34. .99 

FT /label= Human_ADAM_disintegrin_domain 

FT Region 238. .465 

FT /note= "Fc region" 

XX 

PN WO200162905-A2 . 
XX 

PD 30-AUG-2001. 
XX 

PF 23-FEB-2001; 2001WO-US005701 . 
XX 

PR 25-FEB-2000; 2000US-0184865P . 
XX 

PA (IMMV ) IMMUNEX CORP. 
XX 

PI Fanslow WC, Cerretti DP, Poindexter KM, Black RA; 
XX 

DR WPI; 2001-625725/72. 

DR N-PSDB; AAD21433. 
XX 

PT Antagonizing the binding of an integrin to its ligand useful for the 

PT treatment of angiogenesis comprises administration of an ADAM-disintegrin 

PT domain polypeptide. 

XX 

PS Claim 11; Page 40-41; 66pp; English. 
XX 

CC The invention relates to the method and use of ADAM disintegrin domain 

CC polypeptides for inhibiting the biological activity of integrins, 

CC endothelial cell migration and angiogenesis. ADAM disintegrin domain 

CC polypeptides are used for treatment of ocular disorders, malignant and 

CC metastatic conditions, inflammatory diseases, osteoporosis and other 

CC conditions mediated by accelerated bone resorption, restenosis, 

CC inappropriate platelet activation, recruitment or aggregation, thrombosis 

CC or a condition requiring aggregation, thrombosis or a condition requiring 

CC tissue repair or wound healing, angiogenesis, ocular neovascularisation 

CC or solid tumour, diabetic retinopathy, retinopathy or prematurity, 

CC neovascular glaucoma, retinoblastoma, retrolental fibroplasias, rubeosis, 

CC uveitis, macular degeneration and corneal graft neovascularisation, 

CC inflammatory diseases, ocular tumours, diseases associated with choroidal 

CC or iris neovascularisation, arthritis, rheumatism, inflammatory bowel 

CC disease, psoriasis, coronary artery disease or injury, myocardial 

CC infarction or injury following myocardial infarction, stroke, unstable 

CC angina, atherosclerosis, arteriosclerosis, preeclampsia, embolism, 

CC platelet-associated ischaemic disorders including lung ischaemia, 



CC coronary ischaemia, cerebral ischaemia, restenosis following percutaneous 

CC coronary intervention including angioplasty, atherectomy, stent placement 

CC and bypass surgery, thrombotic disorders including coronary artery 

CC thrombosis, cerebral artery thrombosis, intracardiac thrombosis, 

CC peripheral artery thrombosis, venous thrombosis, thrombosis and 

CC coagulopathies associated with exposure to a foreign or injured tissue 

CC surface and reocclusion following thrombosis, deep venous thrombosis, 

CC pulmonary embolism, transient ischaemic attacks and another conditions 

CC where vascular occlusion is a common underlying feature, in individuals 

CC at high risk for thrombus formation of reformation, advanced coronary 

CC artery disease, or for occlusion, reocclusion, stenosis and/or restenosis 

CC of blood vessels or stroke benign tumours and preneoplastic conditions, 

CC myocardial angiogenesis , haemophilic joints, scleroderma, vascular 

CC adhesions, asthma and allergy, eczema and dermatitis, graft versus host 

CC disease, sepsis, adult respiratory distress syndrome, telangiectasia and 

CC wound granulation. The method are used in combination with angioplasty 

CC procedures, such as balloon angioplasty, laser angioplasty, coronary 

CC atherectomy or similar techniques, carotid endarterectomy, anastomosis of 

CC vascular grafts, surgery having a high risk of thrombus formation (i.e. 

CC coronary bypass surgery, insertion of a prosthetic valve or vessel and 

CC the like) , atherectomy, stent placement, placement of a chronic 

CC cardiovascular device such as an in-dwelling catheter or prosthetic valve 

CC or vessel, organ transplantation or bypass surgery. The present sequence 

CC is ADAM disintegrin domain polypeptide fusion construct. The fusion 

CC construct comprises of immunoglobulin K (IgK) leader, human ADAM 

CC disintegrin and Fc region 

XX 

SQ Sequence 465 AA; 

Query Match 77.0%; Score 1265.5; DB 4; Length 465; 

Best Local Similarity 56.1%; Pred. No. 7.3e-84; 

Matches 261; Conservative 11; Mismatches 30; Indels 163; Gaps 7 

Qy 1 METDTLLLWVLLLWVPGSTG DVTM L 2 5 

I I I II I I I I I I I I I I I I I I I I I 

Db 1 METDTLLLWVLLLWVPGSTGTSCGNGMVEQGEECDCGYSDQCKDECCFDANQPEGRKCKL 60 

Qy 26 QMAGQCSQNE YFDSLLHACIP 46 

: I I I : : I : I I I 

Db 61 KPGKQCSPSQGPCCTAQCAFKSKSEKCRDDSDCAREGICNGFTALCPASDPKPNFTDCNR 12 0 

Qy 47 CQLRCSSNTPPLTC 60 

I : I I I I 

Db 121 HTQVCINGQCAGSICEKYGLEECTCASSDGKDDKELCHVCCMKKMDPSTCASTGSVQWSR 180 

Qy 61 QRYCN ASVTNSV KGVDK 77 

: I I : I : : : : I I 

Db 181 HFSGRTITLQPGSPCNDFRGYCDVFMRCRLVDADGPLARLKKAIFSPELYENIAERSCDK 240 

Qy 7 8 THTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGV 137 

II I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I II I I II I I I I I I I I I I I 

Db 241 THTCPPCPAPE7VEGAPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYVDGV 300 

Qy 138 EVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQ 197 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 EVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQ 360 



Qy 198 PREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDG 257 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 361 PREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDG 420 



Qy 


258 SFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 


421 SFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 465 


RESULT 13 


AA014136 


ID 


AA014136 standard; protein; 404 AA. 


XX 




AC 


AA014136; 


XX 




DT 


2 9-AUG-2 003 (revised) 


DT 


02-MAY-2002 (first entry) 


XX 




DE 


Protein of a complete TACI extracellular domain fused to a human IgG-Fc. 


XX 




KW 


Human transmembrane activator CAML interactor protein; TACI; cytostatic; 


KW 


cell proliferation; tumour; vulnerary; renal cell cancer; mastocytoma; 


KW 


Kaposi's sarcoma; breast; ovarian carcinoma; rectal; throat; melanoma; 


KW 


colon; bladder; mammary adenocarcinoma; gastrointestinal; hyperplasia; 


KW 


pharyngeal squamous cell; stomach; cellular hyperprolif eration; pannus ; 


KW 


scleroderma; rheumatoid arthritis; scarring; mean survival time; liver; 


KW 


lung fibrosis; uterine; IgG-Fc; chimeric. 


XX 




OS 


Homo sapiens . 


OS 


Chimeric . 


XX 




PN 


WO200181417-A2 . 


XX 




PD 


01-NOV-2 001. 


XX 




PF 


27-APR-2001; 2001WO-US040626 . 


XX 




PR 


27-APR-2 000; 2 0 OOUS-O 19994 6P . 


XX 




PA 


(BIOJ ) BIOGEN INC. 


PA 


(APOT-) APOTECH R&D SA. 


XX 




PI 


Ambrose C, Thompson J, Schneider P, Rennert P; 


XX 




DR 


WPI; 2002-062027/08. 


DR 


N-PSDB; AAK98732. 


XX 




PT 


Treating mammal for condition associated with undesired cell 


PT 


proliferation e.g., solid tumor or reducing solid tumor size located in 


PT 


mammal comprises administering transmembrane activator CAML interactor 


PT 


protein reagent. 


XX 




PS 


Example; Fig 5; 42pp; English. 


XX 




cc 


This sequence represents the protein of a complete extracellular domain 


cc 


of TACI fused to a human IgG-Fc. The invention relates to treating a 


cc 


mammal for a condition associated with undesired cell proliferation (e.g 



CC a solid tumour, or reducing the size of a solid tumour located on or in a 

CC mammal) comprising administering a transmembrane activator CAML 

CC interactor protein (TACI) reagent. The TACI reagent has cytostatic and 

CC vulnerary activity. Treating a mammal (e.g. human, cow, horse, dog, 

CC mouse, rat or cat) for a condition associated with undesired cell 

CC proliferation (e.g. cancer such as renal cell cancer, Kaposi f s sarcoma, 

CC breast cancer, sarcoma, ovarian carcinoma, rectal cancer, throat cancer, 

CC melanoma, colon cancer, bladder cancer, mastocytoma, lung cancer, mammary 

CC adenocarcinoma, pharyngeal squamous cell carcinoma, gastrointestinal 

CC cancer or stomach cancer) . The method is also useful for treating 

CC cellular hyperprolif eration (hyperplasia) such as scleroderma, pannus 

CC formation in rheumatoid arthritis, post-surgical scarring and lung, liver 

CC and uterine fibrosis. The TACI reagent of the invention can extend mean 

CC survival time of a mammal by 25% as compared to the mean survival time of 

CC a mammal in the absence of administering the TACI reagent. The TACI 

CC reagent also reduces the size of the tumour by 25% or more. (Updated on 

CC 29-AUG-2003 to standardise OS field) 

XX 

SQ Sequence 404 AA; 

Query Match 77.0%; Score 1264.5; DB 5; Length 404; 

Best Local Similarity 77.7%; Pred. No. 7.3e-84; 

Matches 244; Conservative 6; Mismatches 19; Indels 45; Gaps 3; 

Qy 33 QNEYFDSLLHACIPC QLRCSSNTPPLTCQRYCNASVTN 7 0 

I ::: I I I I I I : I I III : I : I I 

Db 92 QGKFYDHLLRDCISCASICGQHPKQCAYFCENKLRSPVNLPP-ELRRQRSGEVENNSDNS 150 

Qy 71 SVKGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 108 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 151 GRYQGLEHRGSEAS PALPGLKLSADQVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 210 

Qy 109 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 168 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I 
Db 211 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 27 0 

Qy 169 LNGKE YKCKVSNKALPAP I EKT I S KAKGQPREPQVYTLP P S RDELTKNQVSLTCLVKGFY 228 

I I I I I I I I II I I I I I I II I I I I I I II I I II I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 

Db 271 LNGKEYKCKVSNKAL PAP I EKT IS KAKGQPREPQVYTLP PS RDELTKNQVSLTCLVKGFY 330 

Qy 229 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 288 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I M II 

Db 331 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 390 

Qy 289 NHYTQKSLSLSPGK 302 

II II II I I I I I I I I 

Db 391 NHYTQKSLSLSPGK 404 



RESULT 14 
AAE35225 

ID AAE35225 standard; protein; 348 AA. 
XX 

AC AAE35225; 
XX 

DT 28-MAY-2003 (first entry) 
XX 



DE Human TACI(dl-29, dlll-154 ) -Fc5 variant construct protein. 
XX 

KW Transmembrane activator; calcium modulator; nephrotropic; antibacterial; 

KW TACI; tumour necrosis factor-like protein; ZTNF2; ZTNF4; immunoglobulin; 

KW anaemia; gene therapy; cytostatic; antiinflammatory; immunosuppressive; 

KW glomerulonephritis; asthma; bronchitis; graft rejection; septic shock; 

KW dermatological ; neuroprotective; cyclophilin ligand-interactor; human; 

KW autoimmune disease; systemic lupus erythematosus; multiple sclerosis; 

KW diabetes mellitus; rheumatoid arthritis; renal disease; inflammation; 

KW chimeric. 
XX 

OS Homo sapiens . 
XX 

PN WO200294852-A2 . 
XX 

PD 28-NOV-2002. 
XX 

PF 20-MAY-2002; 2002WO-US015910 . 
XX 

PR 24-MAY-2001; 2001US-0293343P . 
XX 

PA (ZYMO ) ZYMOGENETICS INC. 
XX 

PI Rixon MW, Gross JA; 
XX 

DR WPI; 2003-148455/14. 

DR N-PSDB; AAD53772. 
XX 

PT Transmembrane activator and calcium modulator and cyclophilin ligand- 

PT interactor (TACI ) -immunoglobulin fusion protein, for treating cancer or 

PT diabetes, comprises a TACI receptor group and an immunoglobulin group. 
XX 

PS Claim 26; Col 129-130; 71pp; English. 
XX 

CC The invention relates to fusion proteins comprising transmembrane 

CC activator and calcium modulator and cyclophilin ligand-interactor (TACI) 

CC receptor group that binds tumour necrosis factor-like protein (ZTNF)2 or 

CC ZTNF4; and an immunoglobulin group comprising a constant region of an 

CC immunoglobulin. The invention is used to manufacture a medicament for 

CC inhibiting the proliferation of tumour cells in a mammalian subject. The 

CC composition comprising the fusion protein may also be used in treating 

CC autoimmune diseases (e.g. systemic lupus erythematosus, multiple 

CC sclerosis, diabetes mellitus, rheumatoid arthritis and asthma), renal 

CC diseases (e.g. glomerulonephritis), bronchitis, inflammation, graft 

CC rejection, anaemia and septic shock. The fusion proteins are also used in 

CC gene therapy. The present sequence is human TACI-Fc5 variant construct 

CC protein 

XX 

SQ Sequence 348 AA; 

Query Match 76.9%; Score 1263; DB 6; Length 348; 
Best Local Similarity 86.7%; Pred. No. 7.9e-84; 

Matches 234; Conservative 8; Mismatches 26; Indels 2; Gaps 1 

Qy 33 QNEYFDSLLHACIPCQLRCSSNTPPLTCQRYCNASVTNSVKGVDKTHTCPPCPAPELLGG 92 

I : : : I I I III I : I I : I : : I I I I I I I I I I I I I I I 

Db 81 QGKFYDHLLRDCISCASICGQH — PKQCAYFCENKLRSEPKSSDKTHTCPPCPAPEAEGA 138 



Qy 93 PSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYN 152 

I I I I I I I I I I I I I I I I I I I I I I I I i 1 II I I I I I I I I I I I I I II I I I I I II I I I I I I | I I I 
Db 139 PSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYN 198 

Qy 153 STYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDE 212 

I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 199 STYRWSVLTVLHQDWLNGKEYKCKVSNKALPSSIEKTISKAKGQPREPQVYTLPPSRDE 258 

Qy 213 LTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRW 272 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 259 LTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRW 318 

Qy 273 QQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 319 QQGNVFSCSVMHEALHNHYTQKSLSLSPGK 348 



RESULT 15 
ABJ37102 

ID ABJ37102 standard; protein; 608 AA. 
XX 

AC ABJ37102; 
XX 

DT 08-MAY-2003 (first entry) 
XX 

DE Concatameric immunoadhesion human protein sequence SEQ ID No 10. 
XX 

KW Antiinflammatory; antibacterial; immunosuppressive; antirheumatic; 

KW antiarthritic; immunomodulator ; concatameric protein; soluble domain; 

KW dimeric protein; inflammation; septicaemia; cytotoxicity; 

KW rheumatoid arthritis; cachexia; inflammation; human. 
XX 

OS Homo sapiens . 
XX 

PN WO2003010202-A1. 
XX 

PD 06-FEB-2003. 
XX 

PF 26-JUL-2002; 2 0 02WO-KR00 1427 . 
XX 

PR 26-JUL-2001; 2001KR-00045028 . 
XX 

PA (MEDE-) MEDEXGEN CO LTD. 
XX 

PI Chung Y, Han J, Lee H, Choi E, Kim J; 
XX 

DR WPI; 2003-229639/22. 

DR N-PSDB; ABT32045. 
XX 

PT New concatameric protein having two soluble domains, useful for 

PT diagnosing and treating disorders associated with the dimeric protein or 

PT its glycosylated form, such as inflammation, septicemia, rheumatoid 

PT arthritis and cachexia. 

XX 

PS Claim 27; Page 139-142; 211pp; English. 
XX 



CC The invention relates to a novel concatameric protein comprising two 

CC soluble domains, in which an N-terminus of a soluble domain of a 

CC biologically active protein is linked to a C-terminus of an identical 

CC soluble domain or a different soluble domain of a biologically active 

CC protein. The methods and compositions of the present invention are useful 

CC for the diagnosis and treatment of disorders associated with dimeric 

CC protein or its glycosylated form, such as inflammation, septicaemia, 

CC cytotoxicity, rheumatoid arthritis, cachexia and other inf lammation- 

CC related diseases. This sequence represents the human concatameric protein 

CC of the invention 

XX 

SQ Sequence 608 AA; 

Query Match 76.7%; Score 1260; DB 6; Length 608; 

Best Local Similarity 76.5%; Pred. No. 2.5e-83; 

Matches 244; Conservative 6; Mismatches 21; Indels 48; Gaps 4 

Qy 31 CSQNE YFDSLLHACIPCQLRCSSNTPPLTCQRYCNASVT 69 

I : I : I : I I I I I : I hit I I 

Db 291 CRKNQYRHYWSENLFQCFNCSL-CLNGTVHLSCQEKQNTVCTCHAGFFLRENECVSCSNC 349 

Qy 7 0 NSVKGV DKTHTCPPCPAPELLGGPSVFLFPPKPK 103 

: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 350 KKSLECTKLCLPQIENVKGTEDSGTTAEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPK 409 

Qy 104 DTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTV 163 

II I I I I I I I I I I I I I I 1 I I M I I I I I I I I I I I I I I I I I I I I I I II I I I II II I II I I I I I 

Db 410 DTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEWNAKTKPREEQYNSTYRWSVLTV 4 69 

Qy 164 LHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCL 223 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 11 I I I I II I I 
Db 470 LHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCL 529 

Qy 22 4 VKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVM 2 83 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 530 VKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVM 589 

Qy 284 HEALHNHYTQKSLSLSPGK 302 

I I I I I I I I I I I I I I I I I I I 
Db 590 HEALHNHYTQKSLSLSPGK 608 



Search completed: April 19, 2004, 13:20:59 
Job time : 64 . 7613 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 



April 19, 2004, 13:19:08 ; Search time 20.5062 Seconds 

(without alignments) 
760.310 Million cell updates/sec 

PCT-US03-05147-3 
1643 

1 METDTLLLWVLLLWVPGSTG MHEALHNHYTQKSLSLSPGK 302 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 389414 seqs, 51625971 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



389414 



Database 



Issued_Patents_AA: * 

1: /cgn2_6/ptodata/2/iaa/5A_COMB.pep: * 

2 : /cgn2_6/ptodata/2/iaa/5B_COMB.pep: * 

3 : /cgn2_6/ptodata/2/iaa/6A_COMB.pep: * 

4 : /cgn2_6/ptodata/2/iaa/6B_COMB.pep: * 

5 : /cgn2_6/ptodata/2/iaa/PCTUS_C0MB.pep : * 

6 : /cgn2_6/ptodata/2/iaa/backf ilesl . pep : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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Sequence 


9, Appli 


45 


1237 


75 


3 


451 


2 


US-08-887-352B-14 


Sequence 
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ALIGNMENTS 



RESULT 1 

US-09-612-033B-14 

; Sequence 14, Application US/09612033B 

; Patent No. 6627199 

; GENERAL INFORMATION: 

; APPLICANT: Saris, Chris 

; TITLE OF INVENTION: Isolation, Identification, and Characterization of 
; TITLE OF INVENTION : tmst2, a No. 6627199el Member of the TNF-Receptor 
Super family 

; TITLE OF INVENTION: of Genes 
; FILE REFERENCE: 01017/35434A 

; CURRENT APPLICATION NUMBER: US/09/ 612 , 033B 

; CURRENT FILING DATE: 2000-07-07 

; PRIOR APPLICATION NUMBER: US 60/143,063 

; PRIOR FILING DATE: 1999-07-09 

; NUMBER OF SEQ ID NOS : 15 

; SOFTWARE: Patentln Ver. 2.0 

; SEQ ID NO 14 



; LENGTH: 39 8 

TYPE: PRT 
; ORGANISM: Artificial Sequence 

FEATURE : 

; OTHER INFORMATION: Description of Artificial Sequence: Fusion protein 

OTHER INFORMATION: consisting of Mus musculus sequences and 
; OTHER INFORMATION: Immunoglobulin sequences 
US-09-612-033B-14 

Query Match 77.0%; Score 1264.5; DB 4; Length 398; 

Best Local Similarity 83.4%; Pred. No. 1.5e-112; 

Matches 242; Conservative 6; Mismatches 27; Indels 15; Gaps 5 

Qy 26 QMAGQCSQNE YFD-SLLHACIPCQLRCSSNTPPL-TCQRYCNASVTNSV 72 

: I II I : I : | | | : | | | | | : : | | 

Db 111 EMVADCSATSDRKCQCRTGLYYYDPKFPESCRPC-TKCPQGIPVLQECNSTANTVCSSSV 169 

Qy 73 KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNW 132 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 170 SNVD-THTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNW 228 

Qy 133 YVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS 192 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 229 YVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI S 288 

Qy 193 KAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPV 252 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 289 KAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPV 348 

Qy 253 LDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 34 9 LDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 39 8 



RESULT 2 

US-09-180-100-11 

; Sequence 11, Application US/09180100 
; Patent No. 6306395 
; GENERAL INFORMATION: 

; APPLICANT: NAKAMURA, No. 6306395io 
; APPLICANT: NAGATA, Shigekazu 

; TITLE OF INVENTION: NOVEL Fas ANTIGEN DERIVATIVE 
; FILE REFERENCE: 1110-207P 

; CURRENT APPLICATION NUMBER: US/09/180 , 100 
; CURRENT FILING DATE: 1998-11-02 

EARLIER APPLICATION NUMBER: PCT/ JP97/ 015 02 

EARLIER FILING DATE: 1997-05-01 

NUMBER OF SEQ ID NOS : 25 
; SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 11 
; LENGTH: 360 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-180-100-11 



Query Match 76.5%; Score 1257.5; DB 4; Length 360; 

Best Local Similarity 84.8%; Pred. No. 6e-112; 



Matches 24 0; Conservative 



8; Mismatches 20; Indels 15; Gaps 4 



Qy 



30 QCSQNEYFDS-LLHACIPC QLRCSSNTPPLTCQRYCNASVTNSVKGVDKTH 7 9 



Db 



8 3 RCKPNFFCNSTVCEHCDPCTKCEHGIIKECTLTSNT KCKE — EGSRSNEPKSCDKTH 137 



Qy 



8 0 TCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEV 139 




Db 



13 8 TCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYVDGVEV 197 



Qy 



140 HNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPR 199 




Db 



198 HNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPR 257 



Qy 



2 00 EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF 2 59 




Db 



258 EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF 317 



Qy 



260 FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 




Db 



318 FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 360 



RESULT 3 

US-09-180-100-22 

; Sequence 22, Application US/09180100 
; Patent No. 6306395 
; GENERAL INFORMATION: 

; APPLICANT: NAKAMURA, No. 6306395io 
; APPLICANT: NAGATA, Shigekazu 

; TITLE OF INVENTION: NOVEL Fas ANTIGEN DERIVATIVE 
; FILE REFERENCE: 1110-207P 

; CURRENT APPLICATION NUMBER: US/09/ 18 0 , 100 

; CURRENT FILING DATE: 1998-11-02 

; EARLIER APPLICATION NUMBER: PCT/JP97 / 01502 

; EARLIER FILING DATE: 1997-05-01 

; NUMBER OF SEQ ID NOS : 25 

; SOFTWARE: PatentlnVer. 2.0 

; SEQ ID NO 22 

LENGTH: 37 6 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-180-100-22 

Query Match 76.5%; Score 1257.5; DB 4; Length 376; 

Best Local Similarity 84.8%; Pred. No. 6.3e-112; 

Matches 240; Conservative 8; Mismatches 20; Indels 15; Gaps 4 

Qy 30 QCSQNEYFDS-LLHACIPC QLRCSSNTPPLTCQRYCNASVTNSVKGVDKTH 79 

: I | : : | : III : : | | | I : I : I I I I I I 

Db 99 RCKPNFFCNSTVCEHCDPCTKCEHGIIKECTLTSNT KCKE — EGSRSNEPKSCDKTH 153 

Qy 80 TCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEV 139 

I I I I I I I II I I I II I II I I I II I I I I I I I I M I I I I I I I I I I I I I I I I I I I I II I I M M 
Db 154 TCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEV 213 

Qy 140 HNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPR 199 



Db 214 HN7VKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPR 273 

Qy 200 EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF 259 

I M | I I I I | I I I I I I I I I II I I I II I I I I I I I I I I I I I I I M I II II I I I I I I I I I I I I I 
Db 274 EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF 333 

Qy 260 FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 334 FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 376 



RESULT 4 
US-08-470-299-4 

Sequence 4, Application US/08470299 
Patent No. 5783181 
GENERAL INFORMATION: 

APPLICANT: Browne, Michael J. 
APPLICANT: Murphy, Kay E. 
APPLICANT: Chapman, Conrad G. 
APPLICANT: Clinkenbeard, Helen E. 
APPLICANT: Young, Peter R. 
APPLICANT: Shatzman, Allan R. 

TITLE OF INVENTION: No. 5783181el Compounds 
NUMBER OF SEQUENCES: 21 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SmithKline Beecham Corporation 
STREET: 709 Swedeland Road, P.O. Box 1539 
CITY: King of Prussia 
STATE : Pennsylvania 
COUNTRY: USA 
ZIP: 19406 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/470 , 299 
FILING DATE: 06-JUN-1995 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
NAME: Sutton, Jeffrey A. 
REGISTRATION NUMBER: 34,028 
REFERENCE/DOCKET NUMBER: P31005C3 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 610-2 7 0-5024 
TELEFAX: 610-270-50 90 
INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 387 amino acids 
TYPE: amino acid 
STRANDEDNESS: 
TOPOLOGY: linear 
US-08-470-299-4 



Query Match 



76.1%; Score 1250.5; DB 1; Length 387; 



Best Local Similarity 92.1%; Pred. No. 3.1e-lll; 

Matches 233; Conservative 1; Mismatches 4; Indels 15; Gaps 1 

Qy 50 RCSSNTPPLTCQRYCNASVTNSVKGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMIS 109 

: I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I 

Db 150 KCSSGTEP KSADKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMIS 194 

Qy 110 RTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWL 169 

I I I I II I I I I I I I I II I I II I I I I I I I I I I I I I I I I I II I II II I I I I I I I I I I I I I I I I 
Db 195 RT PEVT C VWDVSHED P EVKFNWYVDGVEVHNAKTKP REEQ YN S T YRWS VLT VLHQDWL 254 

Qy 170 NGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYP 229 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I M I I I 
Db 255 NGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYP 314 

Qy 230 SDIAVEWESNGQPENNYKTT P PVLDS DGS FFL YS KLTVDKSRWQQGNVFS CSVMHEALHN 289 

I I I I M I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 315 SDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHN 374 

Qy 290 HYTQKSLSLSPGK 302 

I I I I I I I I I I I I I 
Db 375 HYTQKSLSLSPGK 387 



RESULT 5 
US-08-236-311-7 

; Sequence 7, Application US/08236311 
; Patent No. 5565335 

GENERAL INFORMATION: 

APPLICANT: Capon, Daniel J. 
APPLICANT: Gregory, Timothy J. 
TITLE OF INVENTION: Adheson Variants 
NUMBER OF SEQUENCES: 25 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Genentech, Inc. 

; STREET: 460 Point San Bruno Blvd 

CITY: South San Francisco 
; STATE: California 

COUNTRY: USA 
ZIP: 94080 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 5.25 inch, 360 Kb floppy disk 
; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

; SOFTWARE: patin (Genentech) 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/ 08 /23 6 , 3 11 

FILING DATE: 02-MAY-1994 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: 07/936190 

FILING DATE: 26-AUG-1992 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/842777 
FILING DATE: 18-FEB-1992 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/250785 



FILING DATE: 28-SEP-1988 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/104329 
FILING DATE: 02-OCT-1987 
ATTORNEY/AGENT INFORMATION: 
NAME: Hasak, Janet E. 
REGISTRATION NUMBER: 28,616 
REFERENCE/DOCKET NUMBER: 444P1C2 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415/225-18 96 
TELEFAX: 415/952-98 81 
TELEX: 910/371-7168 
INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 371 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
US-08-236-311-7 

Query Match 75.7%; Score 1243; DB 1; Length 371; 

Best Local Similarity 92.1%; Pred. No. 1.5e-110; 

Matches 234; Conservative 1; Mismatches 11; Indels 8; Gaps 2 

Qy 51 CSSNTPPLTCQRYCNASVTNSV — KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 108 

I : I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 124 CNVNHKP SNTKVDKKVEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 177 

Qy 109 SRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 168 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II I I I I 
Db 178 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 237 

Qy 169 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 22 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 238 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 2 97 

Qy 22 9 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 2 88 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I II I I I I 
Db 298 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 357 

Qy 289 NHYTQKSLSLSPGK 302 

I I I I I I I I I I I I I I 
Db 358 NHYTQKSLSLSPGK 371 



RESULT 6 
US-08-457-918-7 

; Sequence 7, Application US/08457918 
; Patent No. 6117655 
; GENERAL INFORMATION: 

APPLICANT : Capon, Daniel J. 

APPLICANT: Gregory, Timothy J. 
; TITLE OF INVENTION: Adheson Variants 

NUMBER OF SEQUENCES: 25 

CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Genentech, Inc. 

; STREET: 4 60 Point San Bruno Blvd 

; CITY: South San Francisco 



STATE: California 
; COUNTRY: USA 

; ZIP: 94080 

COMPUTER READABLE FORM: 

MEDIUM TYPE: 5.25 inch, 360 Kb floppy disk 

COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: patin (Genentech) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 8 /457 , 918 

FILING DATE: l-JUN-1995 

CLASSIFICATION: 435 
; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: 08/236311 

FILING DATE: 02-MAY-1994 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: 07/936190 

FILING DATE: 2 6-AUG-19 92 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/842777 

FILING DATE: 18-FEB-1992 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/250785 

FILING DATE: 28-SEP-1988 
; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: 07/104329 

FILING DATE: 02-OCT-1987 
; ATTORNEY/AGENT INFORMATION: 

; NAME: Kubinec, Jeffrey S. 

REGISTRATION NUMBER: 36,575 

REFERENCE/DOCKET NUMBER: P0444P1C3 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415/225-8228 

TELEFAX: 415/952-9881 
; TELEX: 910/371-7168 

; INFORMATION FOR SEQ ID NO: 7: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 371 amino acids 
; TYPE: amino acid 

; TOPOLOGY: linear 

US-08-457-918-7 

Query Match 75.7%; Score 1243; DB 3; Length 371; 

Best Local Similarity 92.1%; Pred. No. 1.5e-110; 

Matches 234; Conservative 1; Mismatches 11; Indels 8; Gaps 2; 

Qy 51 CSSNTPPLTCQRYCNASVTNSV — KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 108 

I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 124 CNVNHKP SNTKVDKKVEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 177 

Qy 109 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 168 



Db 



178 




237 



Qy 



169 



LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 



228 



Db 



238 




297 



229 



PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 2 88 




Db 



298 



PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 357 



Qy 



289 



NHYTQKSLSLSPGK 302 




Db 



358 



NHYTQKSLSLSPGK 371 



RESULT 7 
US-08-397-411-7 

; Sequence 7 , Application US/08397411 

; Patent No. 6129914 

; GENERAL INFORMATION: 

; APPLICANT: Weiner, George 

APPLICANT: Gingrich, Roger 
; APPLICANT: Link, Brian 
; APPLICANT: Tso, J. Yun 

TITLE OF INVENTION: Bispecific Antibody Effective to Treat 
; TITLE OF INVENTION: B-Cell Lymphoma and Cell Line 
; NUMBER OF SEQUENCES : 14 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Townsend and Townsend and Crew 

; STREET: One Market Plaza, Steuart Tower, Suite 2000 

; CITY: San Francisco 

; STATE: California 

COUNTRY: USA 
; ZIP: 94105 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 8 /3 97 , 4 11 
; FILING DATE: 01-MAR-1995 

; CLASSIFICATION: 424 

; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/859,583 

FILING DATE: 2 7 -MAR- 19 92 
ATTORNEY/AGENT INFORMATION: 
; NAME: Smith, William M. 

; REGISTRATION NUMBER: 30,223 

; REFERENCE/DOCKET NUMBER: 011823-004901 

; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-326-2400 

TELEFAX: 415-326-2422 
; INFORMATION FOR SEQ ID NO: 7: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 44 6 amino acids 

; TYPE: amino acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
; MOLECULE TYPE: peptide 
US-08-397-411-7 



Query Match 75.7%; Score 1243; DB 3; Length 446; 

Best Local Similarity 92.1%; Pred. No. 2e-110; 

Matches 234; Conservative 1; Mismatches 11; Indels 8; Gaps 2 



Qy 



51 CSSNTPPLTCQRYCNASVTNSV — KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 10 8 



Db 




Qy 



109 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 168 




Db 



253 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 312 



Qy 



169 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 22 8 




Db 



313 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 372 



Qy 



22 9 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 2 8 8 




Db 



37 3 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 432 



Qy 



289 NHYTQKSLSLSPGK 302 




Db 



433 NHYTQKSLSLSPGK 446 



RESULT 8 

US-08-458-516-13 

; Sequence 13, Application US/08458516 
; Patent No. 5777085 
; GENERAL INFORMATION: 

APPLICANT: Co, Man Sung 

APPLICANT: Tso, J. Yun 
; TITLE OF INVENTION: Humanized Antibodies Reactive with 
; TITLE OF INVENTION: GPIIB/IIIA 

; NUMBER OF SEQUENCES: 23 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: William M. Smith 

STREET: One Market Plaza, Steuart Tower, Suite 2000 
CITY: San Francisco 
; STATE: California 

; COUNTRY: USA 

; ZIP: 94105 

; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

; SOFTWARE: Patentln Release #1.0, Version #1.25 

; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/458,516 

FILING DATE: 
; CLASSIFICATION: 424 

; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/059,159 

FILING DATE: 03-MAY-1993 
; ATTORNEY/AGENT INFORMATION: 

NAME: Smith, William M. 
; REGISTRATION NUMBER: 30,223 



REFERENCE/ DOCKET NUMBER: 11823-37-3 
TELECOMMUNICATION INFORMATION: 
TELEPHONE : 415-32 6-24 00 
TELEFAX: 415-32 6-2422 
INFORMATION FOR SEQ ID NO: 13: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 449 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-458-516-13 

Query Match 75.7%; Score 1243; DB 1; Length 449; 

Best Local Similarity 92.1%; Pred. No. 2e-110; 

Matches 234; Conservative 1; Mismatches 11; Indels 8; Gaps 2 

Qy 51 CSSNTPPLTCQRYCNASVTNSV — KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 108 

I : I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 

Db 2 02 CNVNHKP SNTKVDKKVEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 255 

Qy 109 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 168 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 256 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 315 

Qy 169 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 228 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 316 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 375 

Qy 229 P S DI AVEWESNGQPENNYKTT P PVLD SDGS FFLYS KLTVDKSRWQQGNVFS CS VMHEALH 288 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I II I I I I I I I I I I I I II II 
Db 376 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 435 

Qy 289 NHYTQKSLSLSPGK 302 

I I II I I I I I II I I I 
Db 436 NHYTQKSLSLSPGK 449 



RESULT 9 

US-08-157-101A-7 

Sequence 7, Application US/08157101A 
Patent No. 5808032 
GENERAL INFORMATION: 

APPLICANT: KURIHARA, TATSUYA 
APPLICANT: MATSUKURA, SHIGEKAZU 
APPLICANT: TSURUOKA, NOBUO 
APPLICANT: ARIMA, KENJI 
APPLICANT: NISHIHARA, TATSURO 

TITLE OF INVENTION: ANTI-HBs ANTIBODY GENES AND EXPRESSION 
TITLE OF INVENTION: PLASMIDS THEREFOR 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: PILLSBURY, MADISON & SUTRO 
STREET: 1100 NEW YORK AVENUE, N.W. 
CITY: WASHINGTON 
STATE: D.C. 
COUNTRY: USA 



ZIP : 20005 
; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Floppy disk 

; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/ 0 8/ 157 , 1 01A 

; FILING DATE: 05-APR-1994 

CLASSIFICATION: 530 
ATTORNEY/AGENT INFORMATION: 

NAME : TITUS, MARIANA K 

REGISTRATION NUMBER: 35843 

REFERENCE/DOCKET NUMBER: 9437/204199 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 202-861-3711 

TELEFAX: 202-822-0944 
; TELEX: 6714627 CUCH 

; INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS : 

LENGTH: 459 amino acids 

TYPE: amino acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
US-08-157-101A-7 

Query Match 75.7%; Score 1243; DB 1; Length 459; 

Best Local Similarity 92.1%; Pred. No. 2.1e-110; 

Matches 234; Conservative 1; Mismatches 11; Indels 8; Gaps 2 

Qy 51 CSSNTPPLTCQRYCNASVTNSV — KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 108 

I : I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II II I I I I I 

Db 212 CNVNHKP SNTKVDKKVEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 2 65 

Qy 109 S RT P E VT CVWDVS H E D P EVK FNW YVDGVEVHNAKT KP REEQ YN S T YRWS VLT VLHQDW 168 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 2 66 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 325 

Qy 169 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 228 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 326 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 385 

Qy 229 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 288 

I I I M II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I M I I I I I I 

Db 386 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 445 

Qy 289 NHYTQKSLSLSPGK 302 

I I I I I I M I I I I I I 
Db 446 NHYTQKSLSLSPGK 459 



RESULT 10 
US-08-378-939-10 

; Sequence 10, Application US/08378939 
; Patent No. 5876961 
; GENERAL INFORMATION: 



APPLICANT: CROWE, JAMES SCOTT 
APPLICANT: LEWIS, ALAN PETER 

TITLE OF INVENTION: PRODUCTION OF ANTIBODIES 
NUMBER OF SEQUENCES: 4 6 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: ROTHWELL, FIGG, ERNST & KURZ 
STREET: 555 THIRTEENTH ST. N.W. 
CITY: WASHINGTON 
STATE: D. C. 
COUNTRY: U.S. 
ZIP: 20004 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 8/37 8 , 939 
FILING DATE: 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/952640 
FILING DATE: 01-DEC-1992 
ATTORNEY/AGENT INFORMATION: 
NAME: ERNST, BARBARA G 
REGISTRATION NUMBER: 30,377 
REFERENCE/ DOCKET NUMBER: 1808-118 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (202) 783-6040 
TELEFAX: (202) 783-6031 
INFORMATION FOR SEQ ID NO: 10: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 476 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-378-939-10 

Query Match 75.7%; Score 1243; DB 2; Length 476; 

Best Local Similarity 92.1%; Pred. No. 2.2e-110; 

Matches 234; Conservative 1; Mismatches 11; Indels 8; Gaps 2 

Qy 51 CSSNTPPLTCQRYCNASVTNSV — KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 108 

I : I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 22 9 CNVNHKP SNTKVDKKVEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 2 82 

Qy 109 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 168 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I M I I I I 

Db 2 83 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 342 

Qy 169 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 228 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 343 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 402 

Qy 229 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 288 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 403 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 462 



Qy 289 NHYTQKSLSLSPGK 302 

I I I I I I I I I I I I I I 
Db 463 NHYTQKSLSLSPGK 476 



RESULT 11 
US-09-746-359A-54 

Sequence 54, Application US/09746359A 
Patent No. 6610286 
GENERAL INFORMATION: 
APPLICANT: Thompson, Penny 
APPLICANT: Foster, Donald C. 
APPLICANT : Xu, Wenf eng 
APPLICANT: Madden, Karen L. 
APPLICANT: Kelly, James D. 
APPLICANT : Sprecher, Cindy A. 
APPLICANT: Blumberg, Hal 
APPLICANT: Eagan, Maribeth A. 
APPLICANT: Jaspers, Stephen R. 
APPLICANT: Chandrasekher , Yasmin A. 
APPLICANT: No. 6610286ak, Julia E. 

TITLE OF INVENTION: Method for Treating Inflammation 
FILE REFERENCE: 99-108 

CURRENT APPLICATION NUMBER: US/ 0 9/74 6 , 359A 
CURRENT FILING DATE: 2001-05-21 
PRIOR APPLICATION NUMBER: 60/171,969 
PRIOR FILING DATE: 1999-12-23 
PRIOR APPLICATION NUMBER: 60/213,341 
PRIOR FILING DATE: 2000-06-22 
NUMBER OF SEQ ID NOS : 72 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 54 
LENGTH: 547 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-746-359A-54 



Query Match 75.7%; 
Best Local Similarity 92.1%; 
Matches 234; Conservative 



Score 1243; DB 4; Length 547; 
Pred. No. 2.7e-110; 
1; Mismatches 11; Indels ! 



Gaps 



Qy 



Db 



51 CSSNTPPLTCQRYCNASVTNSV — KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 10 8 

I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

300 CNVNHKP SNTKVDKKVEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 353 



Qy 

Db 

Qy 

Db 

Qy 

Db 



109 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLTVLHQDW 168 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
354 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHN7VKTKPREEQYNSTYRWSVLTVLHQDW 413 



169 



228 



LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 

I I I I I I 1 I I I I I I I I I I I I I I I I II I I II I I I I I I 1 I I I I II I I I I I I I I I I I I I I I I I I 
414 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 473 

22 9 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 288 

I I I M I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
47 4 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 533 



Qy 289 NHYTQKSLSLSPGK 302 

I I I I I II I I I I II I 
Db 534 NHYTQKSLSLSPGK 547 



RESULT 12 
US-09-746-359A-53 

Sequence 53, Application US/09746359A 
Patent No. 6610286 
GENERAL INFORMATION: 
APPLICANT : Thompson, Penny 
APPLICANT: Foster, Donald C. 
APPLICANT: Xu, Wenfeng 
APPLICANT: Madden, Karen L. 
APPLICANT: Kelly, James D. 
APPLICANT: Sprecher, Cindy A. 
APPLICANT: Blumberg, Hal 
APPLICANT: Eagan, Maribeth A. 
APPLICANT: Jaspers, Stephen R. 
APPLICANT: Chandrasekher , Yasmin A. 
APPLICANT: No. 6610286ak, Julia E. 

TITLE OF INVENTION: Method for Treating Inflammation 
FILE REFERENCE: 99-108 

CURRENT APPLICATION NUMBER: US/ 09/7 4 6 , 35 9A 
CURRENT FILING DATE: 2001-05-21 
PRIOR APPLICATION NUMBER: 60/171,969 
PRIOR FILING DATE: 1999-12-23 
PRIOR APPLICATION NUMBER: 60/213,341 
PRIOR FILING DATE: 2000-06-22 
NUMBER OF SEQ ID NOS: 72 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 53 
LENGTH: 571 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-746-359A-53 



Query Match 75.7%; 
Best Local Similarity 92.1%; 
Matches 234; Conservative 



Score 1243; DB 4; Length 571; 
Pred. No. 2.8e-110; 
1; Mismatches 11; Indels I 



Gaps 



2; 



Qy 



Db 



51 CSSNTPPLTCQRYCNASVTNSV — KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 108 

I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

324 CNVNHKP SNTKVDKKVEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 377 



QY 
Db 

Qy 

Db 

Qy 

Db 



109 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 168 

I I I I I I I I I M I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
37 8 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 437 



169 



228 



LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 
I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
438 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 497 

22 9 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 2 8 8 

I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I II II I I I 
4 98 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 5 57 



Qy 289 NHYTQKSLSLSPGK 302 

I I I I I II I I I I I I I 
Db 558 NHYTQKSLSLSPGK 571 



RESULT 13 
US-09-313-942-9 

Sequence 9, Application US/09313942 
Patent No. 6472179 
GENERAL INFORMATION: 
APPLICANT: REGENERON PHARMACEUTICALS, INC. 

TITLE OF INVENTION: RECEPTOR BASED ANTAGONISTS , AND METHODS OF MAKING 
TITLE OF INVENTION: AND USING 
FILE REFERENCE: REG 2 03-A 

CURRENT APPLICATION NUMBER: US/09/313, 942 
CURRENT FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: 09/313,942 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: 60/101,858 
PRIOR FILING DATE: 1998-09-25 
NUMBER OF SEQ ID NOS : 32 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 9 
LENGTH: 951 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-313-942-9 

Query Match 75.7%; Score 1243; DB 4; Length 951; 

Best Local Similarity 92.1%; Pred. No. 6e-110; 

Matches 2 34; Conservative 1; Mismatches 11; Indels 8; Gaps 2 
Qy 51 CSSNTPPLTCQRYCNASVTNSV— KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 108 

hi I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

Db 7 04 CNVNHKP SNTKVDKKVEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 757 

Qy 109 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 168 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 758 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 817 

Qy 169 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 228 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 1 I I II I I I I I I I 
Db 818 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 877 

Qy 22 9 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 28 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 87 8 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 937 



Qy 289 NHYTQKSLSLSPGK 302 

I I I I I I I I I I I I I I 
Db 938 NHYTQKSLSLSPGK 951 



RESULT 14 
US-09-178-869-2 

; Sequence 2, Application US/09178869B 



Patent No. 6197294 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Tao, Weng 
Wong, Shou 
Hickey, William F 
Hammang, Joseph P. 
Baetge, E . Edward 
TITLE OF INVENTION: CELL SURFACE- INDUCED MACROPHAGE ACTIVATION 
FILE REFERENCE: 17810-043 

CURRENT APPLICATION NUMBER: US/09/178, 869B 
CURRENT FILING DATE: 1998-10-26 
NUMBER OF SEQ ID NOS : 14 
SOFTWARE: PatentlnVer. 2.0 
SEQ ID NO 2 
LENGTH: 331 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-178-869-2 

Query Match 75.5%; Score 1241; DB 3; Length 331; 

Best Local Similarity 78.7%; Pred. No. 2e-110; 

Matches 24 0; Conservative 9; Mismatches 30; Indels 26; Gaps 3 

Qy 2 0 GDVTMLQMAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTCQ 61 

I I : : : I : I I : I : III: I 
Db 31 GDNSHVEMKLAVDEEENADNNTKANVTKPKRCSGS 1 C YGT I AVI VFFL I GFMI GYL 8 6 

Qy 62 RYCNA SVTNSVKGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCV 117 

II : : I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 87 GYCKGVEPKTEGSEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCV 14 6 

Qy 118 WDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCK 177 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 147 WDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCK 206 

Qy 178 VSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWE 237 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 207 VSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWE 266 

Qy 238 SNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLS 297 

! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 267 SNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLS 326 

Qy 298 LSPGK 302 

I I I I I 

Db 327 LSPGK 331 



RESULT 15 
US-09-761-413-2 

; Sequence 2, Application US/09761413 

; Patent No. 6506891 

; GENERAL INFORMATION: 

; APPLICANT: Tao, Weng 

; APPLICANT: Wong, Shou 

; APPLICANT: Hickey, William F 

; APPLICANT: Hammang, Joseph P. 



APPLICANT: Baetge, E. Edward 

TITLE OF INVENTION: CELL SURFACE-INDUCED MACROPHAGE ACTIVATION 
FILE REFERENCE: 17810-043 

CURRENT APPLICATION NUMBER: US/ 09/7 61, 4 13 
CURRENT FILING DATE: 2001-01-16 
PRIOR APPLICATION NUMBER: US/ 09/ 17 8 , 8 69 
PRIOR FILING DATE: 1998-10-26 
NUMBER OF SEQ ID NOS : 14 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 2 
LENGTH: 331 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-761-413-2 

Query Match 75.5%; Score 1241; DB 4; Length 331; 

Best Local Similarity 78.7%; Pred. No. 2e~110; 

Matches 240; Conservative 9; Mismatches 30; Indels 26; Gaps 3 

Qy 2 0 GDVTMLQMAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTCQ 61 

I I : : : I : I I : I : III: I 

Db 31 GDNSHVEMKLAVDEEENADNNTKANVTKPKRCSGS 1 C YGT I AVI VFFL I GFMI GYL 8 6 

Qy 62 RYCNA SVTNSVKGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCV 117 

II : : I I I I I I I I I I I I I I II I I I I i I I I I I I I I I I I II I I II II I I I 

Db 87 GYCKGVEPKTEGSEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCV 14 6 

Qy 118 WDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCK 177 

II I I I I I I I I I I M I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 147 WDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCK 2 06 

Qy 178 VSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWE 237 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 207 VSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWE 266 

Qy 238 SNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLS 297 

I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 267 SNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLS 326 

Qy 298 LSPGK 302 

I I 1 I I 

Db 327 LSPGK 331 



Search completed: April 19, 2004, 13:24:21 
Job time : 21.5062 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



April 19, 2004, 13:18:32 ; Search time 16.7778 Seconds 

(without alignments) 
1731.447 Million cell updates/sec 

PCT-US03-05147-3 
1643 

1 METDTLLLWVLLLWVPGSTG MHEALHNHYTQKSLSLSPGK 302 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 283366 seqs, 96191526 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 



283366 



Database 



PIRJ78 : * 
pirl : * 
pir2 : * 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


1243 


75. 


7 


330 


1 


GHHU 


ig 


gamma- 1 chain C 


2 


1237 . 5 


75. 


3 


374 


2 


S69339 


ig 


heavy chain V r 


3 


1237 


75 


3 


255 


4 


S31866 


ig 


gamma- 1 chain C 


4 


1186 


72 


2 


234 


2 


PT0207 


ig 


gamma chain C r 


5 


1163 


70 


8 


377 


2 


A23511 


ig 


gamma-3 chain C 


6 


1161 


70 


7 


377 


2 


A60764 


ig 


gamma-3 chain C 


7 


1153 


70 


2 


326 


1 


G2HU 


ig 


gamma-2 chain C 


8 


1142 


69 


5 


327 


1 


G4HU 


ig 


gamma-4 chain C 


9 


1138.5 


69 


3 


289 


1 


G3HUWI 


ig 


gamma-3 heavy c 


10 


934.5 


56 


9 


323 


1 


GHRB 


ig 


gamma chain C r 


11 


918.5 


55 


. 9 


328 


2 


147159 


ig 


gamma 2a chain 


12 


917 


55 


. 8 


328 


2 


147160 


ig 


gamma 2b chain 


13 


915 


55 


.7 


277 


2 


147162 


ig 


gamma 4 chain c 



14 


903.5 


55. 


0 


329 


1 


G2GP 


ig 


gamma-2 chain C 


15 


898 . 5 


54 . 


7 


328 


2 


147158 


ig 


gamma 1 chain c 


16 


891.5 


54. 


3 


328 


2 


147161 


ig 


gamma 3 chain c 


17 


870.5 


53. 


0 


470 


2 


S22080 


Ig heavy chain pre 


18 


856 


52. 


1 


329 


1 


G3MSC 


ig 


gamma- 3 chain C 


19 


854 


52. 


0 


308 


2 


C30554 


ig 


heavy chain C r 


20 


854 


52. 


0 


472 


2 


S31459 


ig 


gamma- 1 chain - 


21 


849 


51. 


7 


333 


2 


PS0018 


ig 


gamma- 2b chain 


22 


845 


51. 


4 


398 


1 


G3MSM 


ig 


gamma- 3 chain C 


23 


836.5 


50. 


9 


444 


2 


PC4436 


monoclonal antibod 


24 


827 


50. 


3 


326 


2 


PS0017 


ig 


gamma-1 chain C 


25 


826 


50. 


3 


329 


2 


S00847 


ig 


gamma-2 c chain 


26 


825.5 


50. 


2 


324 


1 


G1MS 


ig 


gamma-1 chain C 


27 


820.5 


49. 


9 


393 


1 


G1MSM 


ig 


gamma-1 chain C 


28 


819.5 


49. 


9 


330 


1 


G2MSA 


ig 


gamma-2a chain 


29 


819.5 


49. 


9 


469 


2 


S37483 


ig 


gamma-2a chain 


30 


818.5 


49. 


8 


335 


1 


G2MSAB 


ig 


gamma-2a chain 


31 


814.5 


49. 


6 


399 


1 


G2MSAM 


ig 


gamma-2a chain 


32 


804.5 


49. 


0 


446 


2 


S40295 


ig 


gamma- 2 a chain 


33 


796 


48. 


4 


322 


2 


PS0019 


ig 


gamma-2 a chain 


34 


792 


48. 


2 


474 


1 


G2MS11 


ig 


gamma-2b chain 


35 


785 


47. 


8 


327 


2 


S06611 


ig 


gamma-2 chain C 


36 


784.5 


47. 


7 


405 


1 


G2MSBM 


ig 


gamma-2b chain 


37 


767.5 


46. 


7 


475 


2 


S01321 


ig 


gamma-2b chain 


38 


707 


43. 


0 


180 


2 


146732 


ig 


gamma heavy cha 


39 


577.5 


35. 


1 


249 


2 


S69340 


ig 


heavy chain VHI 


40 


574.5 


35. 


0 


218 


2 


A36040 


ig 


heavy chain V-I 


41 


571 


34. 


8 


152 


2 


S14236 


ig 


gamma-1 chain C 


42 


400 


24 . 


3 


572 


2 


B46529 


ig 


Y heavy chain ( 


43 


364.5 


22. 


2 


549 


2 


S04845 


ig 


heavy chain pre 


44 


362 . 5 


22 . 


1 


388 


1 


EHMS 


ig 


epsilon chain C 


45 


359.5 


21. 


9 


548 


2 


S38864 


ig 


epsilon chain C 



ALIGNMENTS 



RESULT 1 
GHHU 

Ig gamma-1 chain C region - human 
C; Species: Homo sapiens (man) 

C;Date: 31-Jan-1981 #sequence_revision 18-Aug-1982 #text__change 16-Jul-1999 
C;Accession: A93433; S36861; S33887; B90563; A90564; B91668; A91723; A02146 
R;Ellison, J.W. ; Berson, B.J.; Hood, L.E. 
Nucleic Acids Res. 10, 4071-4079, 1982 

A; Title: The nucleotide sequence of a human immunoglobulin C-gammal gene. 

A; Reference number: A93433; MUID : 82274238 ; PMID: 6287432 

A; Access ion : A93433 

A;Molecule type: DNA 

A; Residues: 1-330 <ELL> 

A;Cross-references : EMBL: Z17370 

A;Note: this sequence has the Glm(17) allotypic marker, 97-Lys, and the Glm(l) 
markers, 239-Asp and 241-Leu 

A;Note: Lys-330 is removed after translation 
R;Harris, L.J. 

submitted to the EMBL Data Library, October 1992 
A;Reference number: S33904 



A; Accession: S3 68 61 

A;Molecule type: DNA 

A; Residues: 2-330 <HAR> 

A; Cross-references : EMBL: Z 1737 0 

R;Takahashi, N.; Ueda, S.; Obata, M. ; Nikaido, T . ; Nakai, S.; Honjo, T. 
Cell 29, 671-679, 1982 

A; Title: Structure of human immunoglobulin gamma genes: implications for 
evolution of a gene family. 

A;Reference number: S33887; MUID : 83001943; PMID:6811139 
A;Accession: S33887 
A;Molecule type: DNA 
A;Residues: 88-113 ; 235-330 <TAK> 
A;Cross-references : EMBL : Z 17370 

R;Cunningham, B.A.; Rutishauser, U.; Gall, W.E.; Gottlieb, P.D.; Waxdal, M.J.; 
Edelman, G.M. 

Biochemistry 9, 3161-3170, 1970 

A; Title: The covalent structure of a human gammaG- immunoglobulin . VII. Amino 

acid sequence of heavy-chain cyanogen bromide fragments H-l-H-4. 

A;Reference number: A90563; MUID : 7106402 4 ; PMID:5489771 

A; Contents: myeloma protein Eu 

A;Accession: B90563 

A;Molecule type: protein 

A; Residues: 1-96, 1 R ' , 98-135 <CUN> 

A;Note: this sequence has the Glm(3) marker, 97-Arg 

R;Rutishauser, U.; Cunningham, B.A. ; Bennett, C; Konigsberg, W.H.; Edelman, 
G.M. 

Biochemistry 9, 3171-3181, 1970 

A;Title: The covalent structure of a human gammaG-immunoglobulin . VIII. Amino 
acid sequence of heavy-chain cyanogen bromide fragments H-5-H-7. 
A;Reference number: A90564; MUID : 71064025; PMID:5530842 
A; Contents: Eu 
A;Accession: A90564 
A;Molecule type: protein 

A; Residues: 136-154, 'Q 1 , 15 6- 165, 'Q', 167-176, 'Q', 178-194, 'N', 196-197, 'D',199- 
238, 'E' ,240, 'M 1 ,242-267, 1 DGEPE ',273-32 9 <RUT> 

A;Note: this sequence has the Glm(non-l) markers, 239-Glu and 241-Met 
R;Ponstingl, H. ; Hilschmann, N. 

Hoppe-Seyler ' s Z. Physiol. Chem. 357, 1571-1604, 1976 
A; Title: Die Primaerstruktur eines monoklonalen IgGl-Immunglobulins 
(Myelomprotein Nie) , III. Die chymotryptischen Peptide der H-Kette, Anordnung 
der tryptischen Peptide und Diskussion der volls taendigen Primaerstruktur. 
A; Reference number: A91668; MUID : 77070269; PMID: 826475 
A; Contents: myeloma protein Nie 
A;Accession: B91668 
A;Molecule type: protein 

A; Residues: 1-34, ' Q ' , 36-96, ' K ' , 98-115 , ' Q ' , 117-197 , 'D\ 199-238, 1 D ! , 240 , ' L ' , 242- 
2 68, 'E* ,27 0-271, 1 D 1 ,273-330 <P0N> 

A;Note: this sequence has the Glm(17) and Glm(l) markers 
R; Schmidt, W.E.; Jung, H.D.; Palm, W. ; Hilschmann, N. 
Hoppe-Seyler 's Z. Physiol. Chem. 364, 713-747, 1983 

A;Title: Die Primaerstruktur des kris tallisierbaren monoklonalen Immunglobulins 
IgGl KOL, I. 

A; Reference number: A91723; MUID : 83289131; PMID: 6884994 
A;Contents: myeloma protein KOL; disulfide bonds 
A;Accession: A91723 
A;Molecule type: protein 



A; Residues: 1-96, 'R', 98-197, 'D\ 199-238, 'E',240, 'M' ,242-266, 'D', 268-271, 'D',273- 
330 <SCH> 

A;Note: this sequence has the Glm(3) and Glm(non-l) markers 
R;Gall, W.E.; Edelman, G.M. 
Biochemistry 9, 3188-3196, 1970 

A; Title: The covalent structure of a human gammaG- immunoglobulin . X. Intrachain 
disulfide bonds. 

A; Reference number: A90565; MUID : 7 1064 027 ; PMID: 4923144 
A;Contents: annotation; disulfide bonds 
R;Dreker, L.; Schwarz, J.; Reichel, W. ; Hilschmann, N. 
Hoppe-Seyler 's Z. Physiol. Chem. 357, 1515-1540, 1976 

A; Title: Rule of antibody structure. The primary structure of monoclonal IgGl 
immunoglobulin (myeloma protein Nie) , I: purification and characterization of 
the protein, the L- and H-chains the cyanogenbromide cleavage products, and the 
disulfide bridges. 

A; Reference number: A91667; MUID : 77 070267 ; PMID: 1002129 

A;Contents: annotation; disulfide bonds 

C; Genetics : 

A; Gene: GDB : IGHG1 

A;Cross-references : GDB: 120085; OMIM: 147100 
A; Map position: 14q32 . 33-14q32 . 33 
A;Introns: 99/1; 114/1; 224/1 

C; Complex: An immunoglobulin heterotetramer subunit consists of two identical 

light (kappa or lambda) and two identical heavy (alpha, delta, epsilon, gamma, 

or mu) chains usually stabilized by interchain disulfide bonds. In some cases, 

such as IgA and IgM, the subunits associate into larger oligomers. 

C; Super family : immunoglobulin C region; immunoglobulin homology 

C;Keywords: duplication; glycoprotein; heterotetramer; immunoglobulin 

F; 2 0-85/Domain : immunoglobulin homology <IM1> 

F; 137-2 06/Domain : immunoglobulin homology <IM2> 

F; 243-310/Domain : immunoglobulin homology <IM3> 

F;27-83, 144-204, 250-308/Disulf ide bonds: ffstatus experimental 

F; 103/Disulf ide bonds: interchain (to light chain) ((status experimental 

F; 109, 112/Disulf ide bonds: interchain (to heavy chain) ((status experimental 

F; 180/Binding site: carbohydrate (Asn) (covalent) ((status experimental 

Query Match 75.7%; Score 1243; DB 1; Length 330; 

Best Local Similarity 92.1%; Pred. No. 8.7e-83; 

Matches 234; Conservative 1; Mismatches 11; Indels 8; Gaps 2; 

Qy 51 CSSNTPPLTCQRYCNASVTNSV — KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 108 

I : I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I 

Db 83 CNVNHKP SNTKVDKKVEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 136 

Qy 109 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 168 

I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I 
Db 137 SRTPEVTCWVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 196 

Qy 169 LNGKEYKCKVSNK7\LPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 228 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I 
Db 197 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 256 

Qy 229 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 288 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 257 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 316 



Qy 



289 NHYTQKSLSLSPGK 302 



Db 



317 NHYTQKSLSLSPGK 330 



RESULT 2 
S69339 



Ig heavy chain V region precursor - human 
C; Species: Homo sapiens (man) 

C;Date: 19-Mar-1997 #sequence_revision 19-Mar-1997 #text_change Ol-Dec-2000 
C;Accession: S69339; S72664 

R;Khamlichi, A. A. ; Aucouturier, P.; Preud 1 homme, J.L.; Cogne, M. 
Eur. J. Biochem. 229, 54-60, 1995 

A; Title: Structure of abnormal heavy chains in human heavy-chain-deposition 
disease. 

A; Reference number: S69339; MUID : 95262687 ; PMID:7744049 

A;Accession: S69339 

A; Status : preliminary 

A;Molecule type: mRNA 

A; Residues: 1-374 <KHA> 

A; Cross- references : EMBL:X81695 

R;Khamlichi, A. A. 

submitted to the EMBL Data Library, September 1994 

A; Reference number: S72664 

A; Accession: S72 664 

A; Status: preliminary 

A;Molecule type: mRNA 

A;Residues: 1-140, 1 C , 142-374 <KH2> 

A; Cross-references : EMBL:X81695 

C; Superf amily : immunoglobulin C region; immunoglobulin homology 

Query Match 75.3%; Score 1237.5; DB 2; Length 374; 

Best Local Similarity 79.0%; Pred. No. 2.5e-82; 

Matches 237; Conservative 9; Mismatches 23; Indels 31; Gaps 2 

Qy 23 TMLQMAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTCQRYCNAS 67 

II: hh | : : | | Ml 

Db 86 TRLTITKDTSKNQW LTMTNVDPADTATYYCGYSVEGYGQGYRFHSWGQ 134 

Qy 68 VTNSVKGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVS 122 



Db 




Qy 



123 HEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKA 182 





Db 



195 HEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKA 254 



QY 



183 LPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQP 242 





Db 



2 55 LPAPIEKTISKAKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESNGQP 314 



Qy 



243 ENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 3 02 



Db 



315 ENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 37 4 




RESULT 3 
S31866 



Ig gamma-1 chain C region - synthetic 
C; Species: synthetic 

A;Note: Homo sapiens (man) gene engineered and expressed in Escherichia coli 
C;Date: 06-Jan-1995 #sequence_revision 17-Mar-1997 #text_change 19-May-2000 
C; Accession: S3 18 66 
R;Filpula, D. 

submitted to the EMBL Data Library, February 1993 

A; Description: Screeing method for protein-protein interactions of cloned gene 
products . 

A; Reference number: S31866 
A;Accession: S31866 
A;Molecule type: mRNA 
A; Residues: 1-255 <FIL> 

A; Cross-references : EMBL:X70421; NID:g33068; PIDN : CAA4 98 66 . 1 ; PID:g33069 
C; Keywords: immunoglobulin 

F; 1-22/Region : Escherichia coli outer membrane protein A precursor 
F; 23-255/Region : human Ig gamma-1 chain C region 



Query Match 75.3%; Score 1237; DB 4; Length 255; 

Best Local Similarity 99.1%; Pred. No. 1.7e-82; 

Matches 228; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 



Qy 


73 


KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNW 


132 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 II 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


26 


KSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNW 


85 


Qy 


133 


YVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS 


192 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


86 


YVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS 


145 


Qy 


193 


K7VKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPV 


252 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 1 1 1 1 1 1 




Db 


146 


KAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPV 


205 


Qy 


253 


LDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 3 02 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


206 


LDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 255 





RESULT 4 
PT0207 

Ig gamma chain C region - chimpanzee 
C; Species: Pan troglodytes (chimpanzee) 

C;Date: 23-Nov-1991 #sequence_revision 23-Nov-1991 #text_change 16-Jul-1999 
C; Accession: PT0207 

R;Ehrlich, P.H.; Moustafa, Z.A. ; Oestberg, L. 
Mol. Immunol. 28, 319-322, 1991 

A; Title: Nucleotide sequence of chimpanzee Fc and hinge regions. 
A;Reference number: PT0207; MUID : 9128 7716 ; PMID:2062315 
A; Access ion: PT02 07 
A;Molecule type: mRNA 
A; Residues: 1-234 <EHR> 

C; Superf amily : immunoglobulin C region; immunoglobulin homology 
C;Keywords: immunoglobulin 

F; 48-117/Domain : immunoglobulin homology <IMM> 



Query Match 



72.2%; Score 1186; DB 2; Length 234; 



Best Local Similarity 94.8%; Pred. No. 7.8e-79; 

Matches 221; Conservative 1; Mismatches 9; Indels 2; Gaps 1; 



65 NASVTNSV— KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVS 122 



Db 



2 NTKVDKKVEPKSCDTTHTCPPCAAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVS 61 



Qy 



123 HEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKA 182 




Db 



62 HEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKA 121 



QY 



183 LPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQP 2 42 



Db 



122 LPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESSGQP 181 



Qy 



243 ENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKS 295 



Db 



182 ENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKS 234 



RESULT 5 
A23511 

Ig gamma-3 chain C region (allotype G3m(b) ) - human 
C; Species: Homo sapiens (man) 

C;Date: 28-Dec-1987 #sequence_revision 28-Dec-1987 #text_change 23-Jul-1999 
C; Access ion: A23511 

R;Huck, S.; Fort, P.; Crawford, D.H.; Lefranc, M.P.; Lefranc, G. 
Nucleic Acids Res. 14, 1779-1789, 1986 

A; Title: Sequence of a human immunoglobulin gamma 3 heavy chain constant region 

gene: comparison with the other human C-gamma genes. 

A;Reference number: A23511; MUID : 8 614 8507 ; PMID:3081877 

A; Accession : A23511 

A;Molecule type: DNA 

A; Residues: 1-377 <HUC> 

A;Cross-references: GB:X03604; GB:M12958; NID:g33070; PIDN : CAA27268 . 1 ; 

PID:g577056 

C; Genetics : 

A; Gene: GDB : IGHG3 

A;Cross-references : GDB: 119339; OMIM: 147120 
A; Map position: 14q32 . 33-14q32 . 33 

A;Introns: 98/3; 115/3; 130/3; 145/3; 160/3; 270/3 

C; Superf amily : immunoglobulin C region; immunoglobulin homology 

C; Keywords: immunoglobulin 

F; 20-85/Domain : immunoglobulin homology <IMM> 

Query Match 70.8%; Score 1163; DB 2; Length 377; 

Best Local Similarity 84.7%; Pred. No. 6.4e-77; 

Matches 221; Conservative 9; Mismatches 19; Indels 12; Gaps 4; 

Qy 46 PCQLRC SSNTPPLTCQRYCNASVTNSVKGVDKTHTCPPCPAPELLGGPSVFLFPPK 101 

II II I : I I I II I I I I I I I I I I I I I I I I I M I I I 

Db 125 PCP-RCPEPKSCDTPP-PCPR CPEPKSCDTPPPCPRCPAPELLGGPSVFLFPPK 176 

Qy 102 PKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVL 161 



Db 



177 PKDTLMISRTPEVTCVWDVSHEDPEVQFKWYVDGVEVHNAKTKPREEQYNSTFRWSVL 236 




Qy 162 TVLHQDWLNGKEYKCKVSNKALPAPIEKT1SKAKGQPREPQVYTLPPSRDELTKNQVSLT 221 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I II I I : I : I I I I I I I I 
Db 2 37 TVLHQDWLNGKEYKCKVSNKALPAP I EKT I SKTKGQPREPQVYTLPP SREEMTKNQVS LT 296 

Qy 222 CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCS 281 

I I I I I I I I 1 I I I I I I I I : I I I I II I I I 1 I : I I I I I I I I I I I I I II I II I I II I I : I I I I 
Db 297 CLVKGFYPSDIAVEWESSGQPENNYNTTPPMLDSDGSFFLYSKLTVDKSRWQQGNIFSCS 356 

Qy 282 VMHEALHNHYTQKSLSLSPGK 302 

I I I I I I I I : I I I I I I I I I I I 
Db 357 VMHEALHNRFTQKSLSLSPGK 377 



RESULT 6 
A60764 

Ig gamma-3 chain C region, form LAT - human 
C; Species: Homo sapiens (man) 

C;Date: 14~May-1993 #sequence_revision 14-May-1993 #text_change 16-Jul-1999 
C;Accession: A60764 

R;Huck, S.; Lefranc, G. ; Lefranc, M.P. 
Immunogenetics 30, 250-257, 1989 

A;Title: A human immunoglobulin IGHG3 allele (GmbO, bl, c3, c5, u) with an IGHG4 
converted region and three hinge exons . 

A; Reference number: A60764; MUID : 90007 613 ; PMID : 2571587 
A;Accession: A60764 
A; Status: preliminary 
A;Molecule type: DNA 
A; Residues: 1-377 <HUC> 

C; Super family : immunoglobulin C region; immunoglobulin homology 
C; Keywords: immunoglobulin 

F; 2 0-85/Domain : immunoglobulin homology <IMM> 



Query Match 70.7%; 
Best Local Similarity 84.7%; 
Matches 221; Conservative 



Score 1161; DB 2; 
Pred. No. 8.9e-77; 
9; Mismatches 19; 



Length 377; 
Indels 12; 



Gaps 



4; 



Qy 



Db 



46 PCQLRC SSNTPPLTCQRYCNASVTNSVKGVDKTHTCPPCPAPELLGGPSVFLFPPK 101 

I I M I : I I I II I I I I I I I I I I I I I I I I I I I I I I 

125 PCP-RCPEPKSCDTPP-PCPR CPEPKSCDTPPPCPRCPAPELLGGPSVFLFPPK 176 



Qy 

Db 

Qy 

Db 

Qy 

Db 



102 P KDT LMI S RT P E VT C VWDVS HE D P E VK FNW YVD GVE VHNAKT K P RE EQ YN S T YRWSVL 161 

I I I I I I I I I I I I II I I I I I I I I I I I I I : I I I I I I I I I I II I I I I I I I I I I I I : I I I I I I 
177 PKDTLMISRTPEVTCVWDVSHEDPEVQFKWYVDGVEVHNAKTKPREEQYNSTFRWSVL 236 

162 TVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT 221 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 1 I I I I I I I II : I : I I I I I I I I 
237 TVLHQDWLNGKEYKCKVSNKALPAPIEKTISKTKGQPREPQVYTLPPSREEMTKNQVSLT 296 



222 



281 



CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCS 
I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I 1 I I I I I I I : I 1 I I I I I I I : I I I I I I I 
297 CLVKGFYPSDIAVEWESSGQPENNYNTTPPVLDSDGSFFLYSRLTVDKSRWQEGNVFSCS 356 



Qy 282 VMHEALHNHYTQKSLSLSPGK 302 

I I I I I II I : I I I I I I I I I I I 
Db 357 VMHEALHNRFTQKSLSLSPGK 377 



RESULT 7 
G2HU 

Ig gamma- 2 chain C region - human 
C; Species: Homo sapiens (man) 

C;Date: 30-Apr-1981 #sequence_revision 13-Jun-1983 #text_change 21-Jul-2000 
C;Accession: A93906; A92809; A90752; A93132; A02148 
R;Ellison, J.; Hood, L. 

Proc. Natl. Acad. Sci. U.S.A. 79, 1984-1988, 1982 

A; Title: Linkage and sequence homology of two human immunoglobulin gamma heavy 
chain constant region genes. 

A/Reference number: A93906; MUID : 82197621 ; PMID:6804948 
A; Accession: A93906 
A;Molecule type : DNA 
A; Residues: 1-326 <ELL> 

A; Cross-references : GB:V00554; GB:J00230; NID:g32759; PIDN : CAB58438 . 1 ; 
PID:g6066056 

A;Note: Lys-326 is probably removed posttranslationally 
R;Wang, A.C.; Tung, E . ; Eudenberg, H.H. 
J. Immunol. 125, 1048-1054, 1980 

A; Title: The primary structure of a human IgG2 heavy chain: genetic, 

evolutionary, and functional implications. 

A;Reference number: A92809; MUID : 8 1007873 ; PMID:6774012 

A; Contents: myeloma protein Til 

A;Accession: A92809 

A;Molecule type: protein 

A; Residues: 1-19, ! Q', 21-57, ' Z',59, 1 A 1 , 61-193 , 'D*, 195-325 <WAN> 
A; Note: Trp-156 is at or near the complement-binding site 
R;Connell, G.E.; Parr, D.M.; Hofmann, T. 
Can. J. Biochem. 57, 758-767, 1979 

A; Title: The amino acid sequences of the three heavy chain constant region 

domains of a human IgG2 myeloma protein. 

A; Reference number: A90752; MUID : 8 0001357 ; PMID: 113060 

A; Contents: myeloma protein Zie 

A;Accession: A90752 

A;Molecule type: protein 

A; Residues: 1-2 4, ' E', 26-57, ' EV , 60-8 5 ; 132-17 1 , ' ZZZ',175, 'B', 17 7-193, 'D',195- 

196, T Q 1 , 198 — 234 , ' Z' ,236-2 63, ' BGEPZ 1 , 2 69-325 <CON> 

A; Note: this sequence has since been revised 

R;Hofmann, T.; Parr, D.M. 

Mol. Immunol. 16, 923-925, 1979 

A; Title: A note on the amino acid sequence of residues 381-391 of human 
immunoglobulin gamma chains. 

A;Reference number: A93132; MUID : 80114419 ; PMID:118920 

A; Contents: Zie 

A; Accession : A93132 

A;Molecule type: protein 

A;Residues: 238-275 <H0F> 

R;Hofmann, T.; Parr, D.M. 

submitted to the Atlas, March 1980 

A; Reference number: A94591 

A;Contents: annotation; Zie, revisions to residues 25, 59, 60, and 264-268 
A;Note: the revised sequence differs from that shown in having 60-Ala and in the 
amidation states of residues 58, 194, and 197; the amidation states of residues 
172-174, 176, and 235 were not determined 
R;Milstein, C; Frangione, B. 
Biochem. J. 121, 217-225, 1971 

A; Title: Disulphide bridges of the heavy chain of human immunoglobulin G2 . 



A;Reference number: A90253; MUID : 72 033500; PMID:4940472 
A; Contents: annotation; myeloma protein Sa, disulfide bonds 
R;Frangione, B . ; Milstein, C; Pink, J.R.L. 
Nature 221, 145-148, 1969 

A; Title: Structural studies of immunoglobulin G . 

A; Reference number: A93157; MUID : 69064 124 ; PMID: 5782707 

A; Contents: annotation; Sa, disulfide bonds 

C; Genetics : 

A; Gene: GDB : IGHG2 

A;Cross-references: GDB:119338; OMIM:147110 
A; Map position: 14q32 . 33-14q32 . 33 

C; Complex: An immunoglobulin heterotetramer subunit consists of two identical 

light (kappa or lambda) and two identical heavy (alpha, delta, epsilon, gamma, 

or mu) chains usually stabilized by interchain disulfide bonds. In some cases, 

such as IgA and IgM, the subunits associate into larger oligomers. 

C; Superf amily : immunoglobulin C region; immunoglobulin homology 

C; Keywords: duplication; glycoprotein; heterotetramer; immunoglobulin 

F;20-85/Domain: immunoglobulin homology <IM1> 

F; 133-2 02 /Domain : immunoglobulin homology <IM2> 

F;239-306/Domain: immunoglobulin homology <IM3> 

F; 14/Disulf ide bonds: interchain (to light chain) #status experimental 

F;27-83, 140-200, 246-304/Disulfide bonds: #status experimental 

F; 102, 103, 106, 109/Disulf ide bonds: interchain (to heavy chain) ((status 

experimental 

F; 17 6/ Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 70.2%; Score 1153; DB 1; Length 326; 

Best Local Similarity 87.6%; Pred. No. 2.8e-76; 

Matches 218; Conservative 8; Mismatches 15; Indels 8; Gaps 3 

Qy 61 QRY-CNASVTNSVKGVDKT HTCPPCPAPELLGGPSVFLFPPKPKDTLMI SRTPE 113 

I I I I I I I I I I I I I II I : I I I I I I I I I I I I I I I I I I I I I I 

Db 79 QTYTCNVDHKPSNTKVDKTVERKCCVECPPCPAPP-VAGPSVFLFPPKPKDTLMI SRTPE 137 

Qy 114 VTCWVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKE 173 

I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I : I I I : I I I I I I I I : M I I I I I I I 
Db 138 VTCVWDVSHEDPEVQFNWWDGV^VTINAKTKPREEQFNSTFRWSVLTVVHQDWLNGKE 197 

Qy 174 YKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIA 233 

I I I I I I M I I I I I I I I I II I I I I I I I I I M I I I I I : I : I I I I I I I I I I I I I | I I I I | | 
Db 198 YKCKVSNKGLPAPIEKTISKTKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIA 257 

Qy 234 VEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQ 293 

I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 258 VEWESNGQPENNYKTTPPMLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQ 317 

Qy 294 KSLSLSPGK 302 

I I I I I I II I 
Db 318 KSLSLSPGK 326 



RESULT 8 
G4HU 

Ig gamma-4 chain C region - human 
C; Species: Homo sapiens (man) 

C;Date: 02-Apr-1982 #sequence_revision 02-Apr-1982 #text_change 16-Jul-1999 
C;Accession: A90933; A90249; A02150 



R;Ellison, J.; Buxbaum, J.; Hood, L. 
DNA 1, 11-18, 1981 

A; Title: Nucleotide sequence of a human immunoglobulin C-gamma4 gene. 
A; Reference number: A90933; MUID : 83157104 ; PMID: 6299662 
A; Accession: A90933 
A; Molecule type: DNA 
A; Residues: 1-327 <ELL> 

A; Note: the sequence was determined from the germline gene 
R;Pink, J.R.L.; Buttery, S.H.; DeVries, G.M. ; Milstein, C. 
Biochem. J. 117, 33-47, 1970 

A; Title: Human immunoglobulin sublclasses. Partial amino acid sequence of the 
constant region of A gamma4 chain. 

A;Reference number: A90249; MUID : 702 07560 ; PMID:4192699 

A; Accession: A902 4 9 

A;Molecule type: protein 

A; Residues: 1-30; 81-326 <PIN> 

C; Genetics : 

A; Gene: GDB : IGHG4 

A; Cross-references: GDB: 119340; OMIM: 147130 
A; Map position: 14q32 . 33-14q32 . 33 
A;Introns: 99/1; 111/1; 221/1 

C; Complex: An immunoglobulin heterotetramer subunit consists of two identical 

light (kappa or lambda) and two identical heavy (alpha, delta, epsilon, gamma, 

or mu) chains usually stabilized by interchain disulfide bonds. In some cases, 

such as IgA and IgM, the subunits associate into larger oligomers. 

C; Superf amily : immunoglobulin C region; immunoglobulin homology 

C; Keywords: duplication; glycoprotein; heterotetramer; immunoglobulin 

F; 2 0-85/Domain : immunoglobulin homology <IM1> 

F; 99-110/Region: hinge 

F; 134-203/Domain : immunoglobulin homology <IM2> 
F; 24 0-307/Domain : immunoglobulin homology <IM3> 

F; 14/Disulf ide bonds: interchain (to light chain) #status experimental 
F;27-83, 141-201, 247-305/Disulfide bonds: #status predicted 

F; 106, 109/Disulf ide bonds: interchain (to heavy chain) #status experimental 
F; 177/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 69.5%; Score 1142; DB 1; Length 327; 

Best Local Similarity 85.4%; Pred. No. 1.8e-75; 

Matches 216; Conservative 10; Mismatches 25; Indels 2; Gaps 1 

Qy 52 SSNTPPLTCQRYCNASVTNSVKGVDKTH — TCPPCPAPELLGGPSVFLFPPKPKDTLMIS 109 

I I II I I II: : II I I I I I I I I I I I I I I I I I I I I I I I II 

Db 75 SLGTKTYTCNVDHKPSNTKVDKRVESKYGPPCPSCPAPEFLGGPSVFLFPPKPKDTLMIS 134 

Qy 110 RTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWL 169 

I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I : I I | I M I II I I I I I I I I I 
Db 135 RTPEVTCVWDVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRWSVLTVLHQDWL 194 

Qy 170 NGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYP 229 

I I I I I I I I I I I I II: I I I II I I II I I I I I I I I I I I I I I :: I : I I I I I I I I I I I I I I I I 
Db 195 NGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYP 254 

Qy 230 SDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCS VMHEALHN 289 

I I I I I M I I I M M I I I I I I I I I I I I I I 1 I I I I I : I I I I I I I I I : I II I I I I I I I I I I I I 
Db 255 SDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSRLTVDKSRWQEGNVFSCSVMHEALHN 314 



Qy 



290 HYTQKSLSLSPGK 302 



Db 315 HYTQKSLSLSLGK 327 



RESULT 9 
G3HUWI 

Ig gamma-3 heavy chain disease proteins - human 
C; Species: Homo sapiens (man) 

C;Date: 31-Dec-1979 #sequence_revision 23-Oct-1981 #text_change 16-Jul-1999 

C;Accession: A90442; A92219; A90198; A93915; A02149 

R; Frangione, B.; Rosenwasser, E.; Prelli, F. ; Franklin, E.C. 

Biochemistry 19, 4304-4308, 1980 

A; Title: Primary structure of human gamma3 immunoglobulin deletion mutant: 
gamma3 heavy-chain disease protein Wis. 

A; Reference number: A90442; MUID: 81021548 ; PMID: 6774747 
A; Contents: heavy chain disease protein Wis 
A; Accession: A904 42 
A;Molecule type: protein 
A; Residues: 1-289 <FRA> 

A;Note: the molecule is a dimer linked by 12 disulfide bonds; it has an extra 
interchain disulfide bond at position 7 in addition to the 11 normally present 
in the hinge region 

A;Note: this protein lacks most of the V region and all of the CHI region. 
Residue 12 corresponds to the beginning of the hinge region of normal gamma-3 
chains 

A;Note: the sequence of residues 42-76 was taken from the reference that follows 
R; Michael sen, T.E.; Frangione, B. ; Franklin, E.C. 
J. Biol. Chem. 252, 883-889, 1977 

A; Title: Primary structure of the 1 hinge ' region of human IgG3. Probable 
quadruplication of a 15-amino acid residue basic unit. 
A; Reference number: A92219; MUID: 77118561; PMID: 402363 

A; Contents: normal gamma-3 chains, sequence corresponding to residues 12-97 of 
protein Wis 
A;Accession: A92219 
A;Molecule type: protein 
A; Residues: 12-97 <MIC> 

A;Note: the hinge region in gamma-3 chains is about four times as long as in 
other gamma chains and contains three identical 15-residue segments (29-43, 44- 
58, and 59-73) preceded by a similar 17-residue segment (12-28) 

A;Note: cysteines at positions 24, 27, 33, 39, 42, 48, 54, 57, 63, 69, and 72 
form inter-heavy chain bonds 

R;Wolf enstein-Todel, C; Frangione, B.; Prelli, F. ; Franklin, E.C. 
Biochem. Biophys . Res. Commun. 71, 907-914, 1976 

A; Title: The amino acid sequence of "heavy chain disease" protein ZUC. Structure 

of the Fc fragment of immunoglobulin G3 . 

A; Reference number: A90198; MUID: 77021516; PMID: 823945 

A; Contents: heavy chain disease protein Zuc, partial sequence corresponding to 

residues 59-289 of protein Wis 

A; Accession: A9 019 8 

A; Molecule type : protein 

A;Residues: 59-125 ,' EB 128-226, 228-2 89 <WOL> 

A;Note: this protein lacks most of the V region, all of the CHI region, and part 
of the hinge compared with normal gamma-3 heavy chains 

R;Alexander, A.; Steinmetz, M. ; Barritault, D.; Frangione, B.; Franklin, E.C; 
Hood, L.; Buxbaum, J.N. 

Proc. Natl. Acad. Sci. U.S.A. 79, 3260-3264, 1982 



A; Title: gamma heavy chain disease in man: cDNA sequence supports partial gene 
deletion model. 

A; Reference number: A93915; MUID : 8224 7 835 ; PMID: 6808505 
A; Contents: heavy chain disease protein Omm 
A;Accession: A93915 
A;Molecule type: mRNA 

A; Residues: 12-70; 72-114 ; 116-125, 'E\ 127-133, 1 L 1 , 135-136, ' E * , 138 , ' Y ',14 0- 

154, 'D', 156-157, ' N 1 , 159-181, ' A' , 183-186; 199-226, 'N 1 , 228-278, ' Y 1 , 280-289 <ALE> 

A;Note: a carboxyl-terminal Lys is removed pos ttranslationally 

A;Note: this sequence may represent an allelic form or another gamma chain 

subclass 

C; Comment: The heavy chain disease protein Wis is shown. 

C; Genetics : 

A; Gene: GDB : IGHG3 

A;Cross-references : GDB: 119339; OMIM: 147120 
A; Map position: 14q32 . 33-14q32 . 33 

C; Super family : immunoglobulin C region; immunoglobulin homology 

C; Keywords: duplication; glycoprotein; immunoglobulin; pyroglutamic acid 

F;2 03-27 0/Domain: immunoglobulin homology <IMM> 

F; 1/Modif ied site: pyrrolidone carboxylic acid (Gin) #status experimental 
F; 6, 140/Binding site: carbohydrate (Asn) (covalent) #status experimental 

Query Match 69.3%; Score 1138.5; DB 1; Length 289; 

Best Local Similarity 75.5%; Pred. No. 2.8e-75; 

Matches 219; Conservative 19; Mismatches 37; Indels 15; Gaps 4; 

Qy 26 QMAG-QCSQNEYFDSLL HACIPCQLRCSSNTPPLTCQRYCNASVTNS 71 

Mil:: : | | I : I I I I I : : 

Db 1 QMQGVNCTVSSELKTPLGDTTHTCPRCPEPKSCDTPP-PCPRCPEPKSCDTPPPCPRCPE 59 

Qy 72 VKGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFN 131 

I I M I I I I I I I I II I I I I I I I I I II II II II M I I I I I I I I I I I I I I I I : I 

Db 60 PKSCDTPPPCPRCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVQFK 119 

Qy 132 WYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTI 191 

I I I I I I : I I I I I I I I I I : I : I I I : I I I I I I I I I I I : I I : I I I I I I I I I I I I I I I I I I I II 

Db 12 0 WYVDGVQVHNAKTKPREQQFNSTFRWSVLTVLHQNWLDGKEYKCKVSNKALPAPIEKTI 17 9 

Qy 192 SKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPP 251 

II I M I I I I I I I I I I I I I : I : I II I I I I I I I I II I I I I I I I I 1 I I I : I I I I I I I MM 

Db 180 SKTKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESSGQPENNYNTTPP 239 

Qy 252 VLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPG 301 

: I M I II I I I I I I I M I I II I I I I I : I I I I I I I II I I I : I I II I II II I 
Db 240 MLDSDGSFFLYSKLTVDKSRWQQGNIFSCSVMHEALHNRFTQKSLSLSPG 289 



RESULT 10 
GHRB 

Ig gamma chain C region - rabbit 

C; Species: Oryctolagus cuniculus (domestic rabbit) 

C;Date: 24-Apr-1984 #sequence_revision 15-Nov-1984 #text_change 16-Jul-1999 
C;Accession: A91749; A90290; A93928; A90245; A94416; A02161 
R;Bernstein, K.E.; Alexander, C.B.; Mage, R.G. 
Immunogenetics 18, 387-397, 1983 

A;Title: Nucleotide sequence of a rabbit IgG heavy chain from the recombinant F 
I haplotype. 



A; Reference number: A91749; MUID : 84 030930 ; PMID: 6313520 
A; Accession: A9174 9 
A;Molecule type: mRNA 
A; Residues: 1-323 <BER> 

A;Note: this sequence has the dl2 allotypic marker, 104-Thr, and the el4 marker, 
185-Thr 

R; Pratt, D.M. ; Mole, L.E. 
Biochem. J. 151, 337-349, 1975 

A; Title: Sequence studies on the constant region of the Fd sections of rabbit 
immunoglobulin G of different allotype. 

A; Reference number: A90290; MUID : 761354 69 ; PMID: 1243651 
A; Accession: A902 90 
A;Molecule type : protein 

A;Residues: 1-47, 'E', 49-71, 1 P V ' , 72-128 <PRA> 

R; Martens, C.L.; Moore, K.W. ; Steinmetz, M. ; Hood, L . ; Knight, K.L. 
Proc. Natl. Acad. Sci. U.S.A. 79, 6018-6022, 1982 

A; Title: Heavy chain genes of rabbit IgG; isolation of a cDNA encoding gamma 
heavy chain and identification of two genomic C-gamma genes. 
A; Reference number: A93928; MUID : 83299917 ; PMID: 6193512 
A; Accession: A93 92 8 
A;Molecule type: mRNA 

A; Residues: 8 8-103, 'M' , 105-143, ' E', 145-184, 'A', 18 6, »E», 18 8-2 66 <MAR> 
A; Cross-references : GB:M16426; NID:gl65111; PIDN: AAA312 89 . 1 ; PID:gl65112 
A; Note: this sequence has the dll allotypic marker, 104-Met, and the el5 
allotypic marker, 185-Ala 

R;Fruchter, R.G.; Jackson, S.A.; Mole, L.E.; Porter, R.R. 
Biochem. J. 116, 249-259, 1970 

A;Title: Sequence studies of the Fd section of the heavy chain of rabbit 
immunoglobulin G . 

A;Reference number: A90245; MUID : 70110015 ; PMID:5461106 

A; Accession: A9024 5 

A;Molecule type: protein 

A; Residues: 132-143 , 1 E 1 , 145-161 <FRU> 

R;Hill, R.L.; Lebovitz, H.E.; Fellows Jr., R.E.; Delaney, R. 

in Gamma Globulins, Nobel Symp. 3, Killander, J., ed., pp. 109-127, Almqvist and 
Wiksell, Stockholm, 1967 
A; Reference number: A94416 
A; Accession: A94416 
A;Molecule type: protein 

A; Residues: 12 9-131 ; 155-172 , 'D ',174-184, 'A 1 , 186, 'E', 188-200, 'D', 202-2 17, 'E',219- 
232, 'Q* , 234-245, 'D', 247-255, 'G', 257-259, 'D', 2 61-265, 'D', 2 67-27 9, 'W',2 81- 
283, 'S 1 ,285-322 <HIL> 

A;Note: this has the el5 allotypic marker, 185-Ala 

C; Complex: An immunoglobulin heterotetramer subunit consists of two identical 

light (kappa or lambda) and two identical heavy (alpha, delta, epsilon, gamma, 

or mu) chains usually stabilized by interchain disulfide bonds. In some cases, 

such as IgA and IgM, the subunits associate into larger oligomers. 

C; Superf amily : immunoglobulin C region; immunoglobulin homology 

C; Keywords: duplication; glycoprotein; heterotetramer; immunoglobulin 

F; 2 0-8 2 /Domain : immunoglobulin homology <IM1> 

F; 130-199/Domain : immunoglobulin homology <IM2> 

F; 2 36-3 03 /Domain : immunoglobulin homology <IM3> 

F;173/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 56.9%; Score 934.5; DB 1; Length 323; 

Best Local Similarity 68.0%; Pred. No. 1.9e-60; 

Matches 174; Conservative 33; Mismatches 40; Indels 9; Gaps 3; 



Qy 52 SSNTPPLTCQRYCNASVTNSVKGVDKT HTC — PPCPAPELLGGPSVFLFPPKPKDTL 106 

: I : : I : I I I : I I I II II I II I I I I I I II I I : I I I I I I I II 

Db 72 TSSSQPVTCNVAHPATNTK VDKTVAPSTCSKPTCPPPELLGGPSVFIFPPKPKDTL 127 

Qy 107 MISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQ 166 

I I I I II I I I I I I I I I I : I I I I : I I I : : : I I : I I : I : I I I I I I I I : I I 
Db 128 MI S RT P E VT CVWDVS Q DD P E VQ FTWY I NN EQ VRTARP P L REQQ FN S T I RWS T L P I THQ 187 

Qy 167 DWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKG 22 6 

III llhllll I I I I II I I I I I I I I I : I I I I I : I I I : I I : I I : Mill:: I 
Db 188 DWLRGKEFKCKVHNKALPAPIEKTISKARGQPLEPKVYTMGPPREELSSRSVSLTCMING 247 

Qy 227 FYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEA 286 

lllllhllll II: I : I I I I I I I I I I I I I : 1 I I : I I : I I I I : I : I I : I I I I I I I 
Db 248 FYPSDISVEWEKNGKAEDNYKTTPAVLDSDGSYFLYNKLSVPTSEWQRGDVFTCSVMHEA 307 

Qy 287 LHNHYTQKSLSLSPGK 302 

I I I I I I I I I : I I I I I 
Db 308 LHNHYTQKSISRSPGK 323 



RESULT 11 
147159 

Ig gamma 2a chain constant region - pig (fragment) 
C; Species: Sus scrofa domestica (domestic pig) 

C;Date: 21-Feb-1997 #sequence_revision 21-Feb-1997 #text_change 21-Jan-2000 
C; Access ion: 147159 

R; Kacskovics , I.; Sun, J.; Butler, J.E. 
J. Immunol. 153, 3565-3573, 1994 

A; Title: Five putative subclasses of swine IgG identified from the cDNA 
sequences of a single animal. 

A;Reference number: 147158; MUID : 95015845 ; PMID: 7930579 
A; Accession: 14715 9 

A; Status: preliminary; translated from GB/EMBL/DDB J 
A; Molecule type: mRNA 
A; Residues: 1-328 <KAC> 

A; Cross-references : EMBL : U0377 9 ; NID:g433123; PIDN : AAA522 17 .1; PID: g433124 
C; Genetics : 
A; Gene: IgG2a 

C; Superf amily : immunoglobulin C region; immunoglobulin homology 
F; 133-202/Domain : immunoglobulin homology <IMM> 

Query Match 55.9%; Score 918.5; DB 2; Length 328; 

Best Local Similarity 61.5%; Pred. No. 2.8e-59; 

Matches 184; Conservative 39; Mismatches 51; Indels 25; Gaps 9; 

Qy 22 VTMLQMAGQCSQNEY-FDSLLHACIPCQLRCSSN — TPP LTCQRY-CNASVTNSVKG 7 4 

I I : : I I : I I : I II I : I I I : : I I I : : 

Db 37 VTVTWN S GAL S S GVHT FP S VLQ P S GL YS L S SMVTVPAS S L S S KS YTCNVNHPATTT K 93 

Qy 75 VDK THTCPPCP APELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHED 125 

Ml I I I I I I : I I I I I I : I I I I I I I I I I I I I I I : I I I I I I I I I I : 

Db 94 VDKRVGTKTKPPCPICPACESP GPSVFIFPPKPKDTLMI SRTPQVT CVWDVS Q EN 149 



Qy 



12 6 PEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPA 185 
I I I : I : I I I I I I I II I : I : I : I I I : I I I I I I I I I I : I I I I I I I I I : I I I I : I I III 



Db 



150 PEVQFSWYVDGVEVHTAQTRPKEEQFNSTYRWSVLPIQHQDWLNGKEFKCKVNNKDLPA 209 



Qy 186 PIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQ--PE 243 

M : I M I I I I I I I I I I I I I I : I I : : : : I I : I I I | | | | 1 M I I I : I M II 
Db 210 PITRIISKAKGQTREPQVYTLPPHAEELSRSKVSITCLVIGFYPPDIDVEWQRNGQPEPE 269 

Qy 2 44 NNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 3 02 

IIHIM I I I - I I I I I : I II : II I : I I : I I I I I M I I I I I I I : I : I I I 
Db 270 GNYRTTPPQQDVDGTYFLYSKFSVDKASWQGGGIFQCAVMHEALHNHYTQKSISKTPGK 328 



RESULT 12 
147160 

Ig gamma 2b chain constant region - pig (fragment) 
C; Species: Sus scrofa domestica (domestic pig) 

C;Date: 21-Feb-1997 #sequence_revision 21-Feb-1997 #text_change 21-Jan-2000 
C; Accession: 14 7160 

R; Kacskovics, I.; Sun, J.; Butler, J.E. 
J. Immunol. 153, 3565-3573, 1994 

A; Title: Five putative subclasses of swine IgG identified from the cDNA 
sequences of a single animal. 

A; Reference number: 147158; MUID : 95015845 ; PMID:7930579 
A; Accession: 14 7160 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-328 <KAC> 

A; Cross-references: EMBL : U03780; NID:g433125; PIDN : AAA52218 . 1 ; PID:g433126 
C; Genetics : 
A; Gene: IgG2b 

C; Superf amily : immunoglobulin C region; immunoglobulin homology 
F; 133-2 02/Domain : immunoglobulin homology <IMM> 

Query Match 55.8%; Score 917; DB 2; Length 328; 

Best Local Similarity 66.9%; Pred. No. 3.6e-59; 

Matches 172; Conservative 34; Mismatches 35; Indels 16; Gaps 5; 

Qy 58 LTCQRY-CNASVTNSVKGVDK THTCPPCP APELLGGPSVFLFPPKPKDTLM 107 

I : = I M : : I I I I I I I II : I I I I I I : I II I I I I I I I 

Db 7 6 LSSKSYTCNVNHPATTTKVDKRVGTKTKPPCPICPACESP GPSVFIFPPKPKDTLM 131 

Qy 108 ISRTPEVTCWVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQD 167 

11111:111111111 I : I I I : I : I I I I I I I II I : I : I : I II : I I I I I I I I I I : III 
Db 132 ISRTPQVTCVWDVSQENPEVQFSWYVDGVEVHTAQTRPKEEQFNSTYRWSVLPIQHQD 191 

Qy 168 WLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGF 227 

I I I M I : I I I I : I I I I I I I : I I I I I I I II I II I I I I I : I I : : : : I I : I I II II 
Db 192 WLNGKEFKCKVNNKDLPAPITRIISKAKGQTREPQVYTLPPHAEELSRSKVSITCLVIGF 251 

Qy 228 YPSDIAVEWESNGQ — PENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHE 285 

I I I I I I I : I I I lllhllll I II : : I I I I I : I M : I I I : I I : I I I I 
Db 252 YPPDIDVEWQRNGQPEPEGNYRTTPPQQDVDGTYFLYSKFSVDKASWQGGGIFQCAVMHE 311 

Qy 286 ALHNHYTQKSLSLSPGK 302 

I I I I I I I I I I : I : I I 1 
Db 312 ALHNHYTQKS I SKTPGK 328 



RESULT 13 
147162 

Ig gamma 4 chain constant region - pig (fragment) 
C; Species: Sus scrofa domestica (domestic pig) 

C;Date: 21-Feb-1997 #sequence_revision 21-Feb-1997 #text_change 21-Jan-2000 
C; Accession: 147162 

R;Kacskovics, I.; Sun, J.; Butler, J.E. 
J. Immunol. 153, 3565-3573, 1994 

A;Title: Five putative subclasses of swine IgG identified from the cDNA 
sequences of a single animal. 

A;Reference number: 147158; MUID : 95015845 ; PMID:7930579 
A; Accession: 147162 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-277 <KAC> 

A;Cross-references : EMBL:U03782; NID:g433129; PIDN :AAA52220 . 1 ; PID:g433130 
C; Genetics : 
A; Gene: IgG4 

C;Superfamily: immunoglobulin C region; immunoglobulin homology 
F; 82-151/Domain: immunoglobulin homology <IMM> 

Query Match 55.7%; Score 915; DB 2; Length 277; 

Best Local Similarity 68.0%; Pred. No. 4.1e-59; 

Matches 172; Conservative 33; Mismatches 40; Indels 8; Gaps 

T CQRY-CNASVTNSVKGVDK THTCPPCP-APELLG-GPSVFLFPPKPKDTLMI SRT 1 

: IM- : I M I I I I I I I I I I I I : I I I I I I I I I I I I I I 



I : I I M I II I I I : I II : I : I I I I I I I I I I : I : I : I I I : I I I I I I I I | | : MINI 



MHIIhll I I I I I : I I II I II I I I I I II I I I : I I : : : : | : | | | | | I ! , I 



1111:111 I I I I : I I I I I I I :: I I I I I I I I I : M : I : I I : | I | | | | | | 



Qy 


58 


Db 


25 


Qy 


112 


Db 


85 


Qy 


172 


Db 


145 


Qy 


232 


Db 


205 


Qy 


290 


Db 


265 



Mill: : I I I 



RESULT 14 
G2GP 

Ig gamma-2 chain C region - guinea pig 
C;Species: Cavia porcellus (guinea pig) 

C;Date: 07-May-1981 #sequence_revision 07-May-1981 #text_change 16-Jul-1999 
C;Accession: A94553; A90352; A90359; A90384; A90385; A02151 
R;Trischmann, T . M. 

submitted to the Atlas, April 1975 
A;Reference number: A94553 
A; Accession: A94553 
A;Molecule type: protein 



A; Residues: 1-3 <TRI> 

R;Birshtein, B.K.; Hussain, Q.Z.; Cebra, J.J. 
Biochemistry 10, 18-25, 1971 

A;Title: Structure of heavy chain from strain 13 guinea pig immunoglobulin-G ( 2 ) . 
III. Amino acid sequence of the region around the half-cystine joining heavy and 
light chains. 

A;Reference number: A90352; MUID : 71058471 ; PMID:5538606 

A; Accession: A90352 

A;Molecule type: protein 

A; Residues: 4-68 <BIR> 

R;Turner, K.J.; Cebra, J.J. 

Biochemistry 10, 9-17, 1971 

A;Title: Structure of heavy chain from strain 13 guinea pig immunoglobulin-G ( 2 ) . 
II. Amino acid sequence of the carboxyl-terminal and hinge region cyanogen 
bromide fragments . 

A;Reference number: A90359; MUID : 71058486; PMID:5538616 

A; Accession: A90359 

A;Molecule type: protein 

A;Residues: 69-133/312-32 9 <TUR> 

R;Tracey, D.E.; Cebra, J.J. 

Biochemistry 13, 4796-4803, 1974 

A; Title: Primary structure of the C-H2 homology region from guinea pig IgG2 
antibodies . 

A; Reference number: A90384; MUID : 75036072 ; PMID: 4429665 

A;Accession: A90384 

A;Molecule type: protein 

A; Residues: 134-226 <TRA> 

R;Trischmann, T.M.; Cebra, J.J. 

Biochemistry 13, 4804-4811, 1974 

A; Title: Primary structure of the C-H3 homology region from guinea pig IgG2 
antibodies . 

A;Reference number: A90385; MUID : 75036073 ; PMID:4609467 

A; Accession: A903 85 

A;Molecule type: protein 

A; Residues: 227-311 <TR2> 

R;01iveira, B. ; Lamm, M.E. 

Biochemistry 10, 26-31, 1971 

A; Title: Interchain disulfide bridges of guinea pig gamma-2- immunoglobulin. 
A; Reference number: A90354; MUID : 71058474 ; PMID: 4922544 
A; Contents: annotation; disulfide bonds 

A; Note: Cys-16 is involved in a heavy-light chain bond 

A;Note: Cys-105, Cys-107, and Cys-110 form inter-heavy chain bonds 

C; Comment: This chain was isolated from pooled serum of strain 13 inbred guinea 

pigs . 

C; Complex: An immunoglobulin heterotetramer subunit consists of two identical 

light (kappa or lambda) and two identical heavy (alpha, delta, epsilon, gamma, 

or mu) chains usually stabilized by interchain disulfide bonds. In some cases, 

such as IgA and IgM, the subunits associate into larger oligomers. 

C; Superf amily: immunoglobulin C region; immunoglobulin homology 

C; Keywords : duplication; glycoprotein; heterotetramer ; immunoglobulin 

F; 21-81/Domain: immunoglobulin homology <IM1> 

F; 135-204/Domain: immunoglobulin homology <IM2> 

F; 2 4 1-3 10/ Domain : immunoglobulin homology <IM3> 

F;28-79/Disulfide bonds: ((status experimental 

F;142-202/Disulfide bonds: ((status experimental 

F;178/Binding site: carbohydrate (Asn) (covalent) ((status experimental 
F;248-308/Disulfide bonds: ((status experimental 



Query Match 55.0%; Score 903.5; DB 1; Length 329; 

Best Local Similarity 65.8%; Pred. No. 3.4e-58; 

Matches 171; Conservative 27; Mismatches 49; Indels 13; Gaps 2 

QY 55 TPPLTCQRYCNASVTNSVKGVDKT HTCPPCPAPELLGGPSVFLFPPKPK 103 

II:: II: I Mil I I I I I II I I I I I I I : I I I I I I 

Db 70 TVPSSQKATCNVAHPASSTKVDKTVEPIRTPZPBPCTCPKCPPPENLGGPSVFIFPPKPK 129 

QY 104 DTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTV 163 

I I I I I I M I I I I I I I II : : I I I : I I : I I I II : I I M I I I I : I : I I | | | : 
Db 130 DTLMISLTPRVTCVWDVSQDEPEVQFTWFVDNKPVGNAETKPRVEQYNTTFRVESVLPI 189 

Qy 164 LHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCL 223 

Mill II I : I I I I I I I I I I I I I I I I I I II || | I I I I I M I I I I : I :: II : I I I 
Db 190 QHQDWLRGKEFKCKVTNKALPAPIEKTISKTKGAPRMPDVYTLPPSRDELSKSKVSVTCL 249 

Qy 224 VKGFYPSDIAVEWESNGQP — ENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCS 281 

: I : I : I I I I I I I I I I I I I I : I : I I I : I I I I I M I I M I I I I : : M 
Db 250 IINFFP7VDIHVEWASNRVPVSEKEYKNTPPIEDADGSYFLYSKLTVDKSAWDQGTVYTCS 309 

Qy 2 82 VMHEALHNHYTQKSLSLSPG 301 

I I I I I I I I I I I I : : I III 
Db 310 VMHEALHNHVTQKAI S RS PG 329 



RESULT 15 
147158 

Ig gamma 1 chain constant region - pig (fragment) 
C; Species: Sus scrofa domestica (domestic pig) 

C;Date: 21-Feb-1997 #sequence_revision 21-Feb-1997 #text_change 21-Jan-2000 
C; Accession: 147158 

R; Kacskovics , I.; Sun, J.; Butler, J.E. 
J. Immunol. 153, 3565-3573, 1994 

A;Title: Five putative subclasses of swine IgG identified from the cDNA 
sequences of a single animal. 

A; Reference number: 147158; MUID : 95015845 ; PMID:7930579 
A; Accession : 14 7158 

A; Status: preliminary; translated from GB/EMBL/DDB J 
A; Molecule type: mRNA 
A; Residues: 1-328 <KAC> 

A;Cross-references : EMBL:U03778; NID:g433121; PIDN : AAA522 16 . 1 ; PID:g433122 
C; Genetics : 
A; Gene : IgGl 

C; Superfamily : immunoglobulin C region; immunoglobulin homology 
F; 133-202/Domain : immunoglobulin homology <IMM> 

Query Match 54.7%; Score 898.5; DB 2; Length 328; 

Best Local Similarity 59.6%; Pred. No. 7.8e-58; 

Matches 180; Conservative 39; Mismatches 64; Indels 19; Gaps 7; 

Qy 10 VLLLWVPGS-TGDV TMLQMAGQCSQNEYFDSLLHACI PCQLRCSSNTPPLTCQRYC 64 

I : I I : I I : : | | : | | : : | | : : | | 

Db 37 VT VTWN S GALT S GVHT FP S VLQP S GL YS L S SM VTVPAS SLSSKSYTCNVNH 87 

Qy 65 NASVTNSVK — GVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVS 122 

Ml I I : : I I I I I I : I I I I I : I I I I I I I I I I I I : II I I I II II I I I 



Db 88 PATTTKVDKRVGIHQPQTCPICPGCE-VAGPSVFIFPPKPKDTLMISQTPEVTCVWDVS 14 6 

QY 123 HEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKA 182 

I I I : I : I I I I I I I I I I : I : I : I I I : M I I I I I I I I : I I I I I I I I : I I I I : I 
Db 147 KEHAEVQFSWYVDGVEVHTAETRPKEEQFNSTYRWSVLPIQHQDWLKGKEFKCKVNNVD 2 06 

QY 183 LPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQ- 241 

Mill = I I I I f M IIIIIIMII :||::::|:||||| ||| : ||, 

Db 207 LPAPITRTISKAIGQSREPQVYTLPPPAEELSRSKVTLTCLVIGFYPPDIHVEWKSNGQP 266 

Qy 242 -PENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSP 300 

Ml I : I I I I I M : M I I I I I 111:11 I : I I : I II I I I I I I I I I I I : I : 
Db 2 67 EPENTYRTTPPQQDVDGTFFLYSKLAVDKARWDHGDKFECAVMHEALHNHYTQKSISKTQ 32 6 

Qy 301 GK 302 

I I 

Db 327 GK 328 

Search completed: April 19, 2004, 13:23:36 
Job time : 17.7778 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: April 19, 2004, 13:21:03 ; Search time 47.2263 Seconds 

(without alignments ) 
1762.857 Million cell updates/sec 

Title: PCT-US 03-0514 7-3 

Perfect score : 1643 

Sequence: 1 METDTLLLWVLLLWVPGSTG MHEALHNHYTQKSLSLSPGK 302 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1124875 seqs, 275673149 residues 

Total number of hits satisfying chosen parameters: 1124875 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications AA: * 

1 : /cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB . pep : * 

2 : / cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 

3 : / cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB.pep: * 

4 : /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep : * 

5 : / cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB.pep: * 

6: /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB.pep: * 

7 : /cgn2_6/ptodata/2/pubpaa/US0 8_NEW_PUB.pep : * 

8 : /cgn2_6/ptodata/2/pubpaa/US0 8_PUBCOMB.pep : * 

9 : /cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB.pep : * 
10 : /cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB.pep: * 
11 : /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep: * 
12 : / cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB.pep : * 
13 : / cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB.pep: * 
14 : / cgn2_6/ptodata/2/pubpaa/US10B_PUBCOMB.pep : * 
15 : / cgn2_6/ptodata/2/pubpaa/US10C__PUBCOMB.pep : * 
16: /cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB.pep: * 
17 : / cgn2_6/ptodata/2/pubpaa/US60__NEW_PUB.pep : * 
18 : / cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB.pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 

No. Score Match Length DB ID Description 
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.6 
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12 
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.1 
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9 


US-09-854-864-10 
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.1 
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12 
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6 
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.7 
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15 
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12 
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9 
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82 


.8 
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12 


US-10-411-026-50 


10 


1276.5 


77 


.7 
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14 


US-10-193-616-14 


11 
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77 


.4 
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14 


US-10-152-363A-62 


12 


1269 


77 


.2 
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9 


US-09-792-200B-10 


13 
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77 


. 0 
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9 


US-09-792-200B-6 


14 
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77 


.0 
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15 


US-10-622-407-14 


15 
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77 


. 0 
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15 


US-10-258-368-15 


16 
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76 
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14 


US-10-152-363A-54 


17 
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76 


.7 
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14 


US-10-363-427-10 


18 
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76 
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9 


US-09-854-864-18 
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76 


.7 
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12 


US-09-855-158-18 
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76 


.6 
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14 


US-10-077-023-133 


21 
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76 


6 
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14 
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22 
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76 


6 
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9 


US-09-875-338-9 


23 
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76 


6 
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14 


US-10-077-023-9 


24 
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76 


6 
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14 


US-10-152-363A-52 


25 


1257.5 


76. 


5 


360 


9 


US-09-949-713-11 


26 


1257.5 


76. 


5 
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9 


US-09-949-713-22 


27 


1257 . 5 


76. 


5 
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14 


US-10-084-139-10 


28 


1254 


76. 


3 
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15 


US-10-390-566-21 


29 


1254 


76. 


3 


266 


15 


US-10-390-566-28 


30 


1254 


76. 


3 
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9 


US-09-792-200B-2 


31 
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76. 


3 
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14 


US-10-152-363A-56 


32 
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76. 


2 
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15 


US-10-390-566-8 


33 
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2 
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15 
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34 
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2 
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14 
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35 
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2 
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14 
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2 
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9 


US-09-792-200B-14 


37 
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10 


US-09-389-782-5 


38 
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1 
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12 
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39 


1249 


76. 


0 


252 


14 
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1249 


76. 


0 
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9 


US-09-792-200B-22 


41 
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76. 


0 
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12 


US-10-633-202-3 


42 


1248 
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10 
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43 


1248 


76. 


0 
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14 
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44 
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76. 


0 
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14 
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45 
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9 
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9 


US-09-948-018-36 



Sequence 12, Appl 

Sequence 9, Appli 
Sequence 9, Appli 

Sequence 10, Appl 
Sequence 10, Appl 
Sequence 8, Appli 
Sequence 6, Appli 
Sequence 50, Appl 
Sequence 50, Appl 
Sequence 14, Appl 
Sequence 62, Appl 

Sequence 10 , Appl 

Sequence 6, Appli 
Sequence 14 , Appl 
Sequence 15, Appl 
Sequence 54, Appl 
Sequence 10, Appl 

Sequence 18, Appl 
Sequence 18, Appl 
Sequence 133, App 
Sequence 135, App 

Sequence 9, Appli 
Sequence 9, Appli 
Sequence 52, Appl 

Sequence 11, Appl 

Sequence 22, Appl 
Sequence 10, Appl 
Sequence 21, Appl 
Sequence 28, Appl 

Sequence 2, Appli 
Sequence 56, Appl 
Sequence 8, Appli 
Sequence 15, Appl 
Sequence 2, Appli 
Sequence 6, Appli 

Sequence 14, Appl 
Sequence 5, Appli 
Sequence 4, Appli 
Sequence 121, App 

Sequence 22, Appl 
Sequence 3 , Appli 
Sequence 14, Appl 
Sequence 14, Appl 
Sequence 5, Appli 

Sequence 36, Appl 



ALIGNMENTS 



RESULT 1 

US-10-115-192-12 

; Sequence 12, Application US/10115192 

; Publication No. US20030082 175A1 

; GENERAL INFORMATION: 

; APPLICANT: Apotech R&D S.A. 



; APPLICANT: Biogen, Inc. 

; TITLE OF INVENTION: April Receptor (BCMA) and Uses Thereof 
; FILE REFERENCE: A083PCT 

; CURRENT APPLICATION NUMBER: US/10/115,192 

; CURRENT FILING DATE: 2002-04-02 

; PRIOR APPLICATION NUMBER: 60/215688 

; PRIOR FILING DATE: 2000-06-30 

; PRIOR APPLICATION NUMBER: 60/181807 

; PRIOR FILING DATE: 2000-02-11 

; PRIOR APPLICATION NUMBER: 60/157933 

; PRIOR FILING DATE: 1999-10-06 

; NUMBER OF SEQ ID NOS : 12 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 12 
; LENGTH: 3 02 

TYPE: PRT 
; ORGANISM: homo sapiens 
US-10-115-192-12 



Query Match 100.0%; Score 1643; DB 14; Length 302; 

Best Local Similarity 100.0%; Pred. No. 1.3e-125; 

Matches 302; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 METDTLLLWVLLLWVPGSTGDVTMLQMAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTC 60 

M I I I I I I I I I I I I I I 1 I I I I M | | | | | | | M I I I I I I I I I I I I M I I I I I II I I I I I I I 
Db 1 METDTLLLWVLLLWVPGSTGDVTMLQMAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTC 60 

Qy 61 QRYCNASVTNSVKGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWD 120 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 QRYCNASVTNSVKGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWD 120 



QY 121 VSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSN 180 

M I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | 
Db 121 VSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSN 180 

Qy 181 KALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNG 240 

I M I I II I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | || | | | | 

Db 181 KALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNG 240 

Qy 241 QPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSP 300 

I I N I I I I I I I I I I I I I I I I I I I I I I I I 1 I M I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 QPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSP 300 

Qy 301 GK 302 

I I 

Db 301 GK 302 



RESULT 2 
US-09-854-864-9 

; Sequence 9, Application US/09854864 

; Patent No. US20020081296A1 

; GENERAL INFORMATION: 

; APPLICANT: THEILL, LARS EYDE 

; APPLICANT: YU, GANG 

; TITLE OF INVENTION: METHODS AND COMPOSITIONS OF MATTER CONCERNING APRIL/G70 , 
BCMA, 



; TITLE OF INVENTION: BLYS/AGP-3, AND TACI 
; FILE REFERENCE: A-686B 

; CURRENT APPLICATION NUMBER: US/ 0 9/ 854 , 8 64 

; CURRENT FILING DATE: 2001-09-11 

; PRIOR APPLICATION NUMBER: US 60/204,039 

; PRIOR FILING DATE: 2000-05-12 

; PRIOR APPLICATION NUMBER: US 60/214,591 

; PRIOR FILING DATE: 2000-06-27 

; NUMBER OF SEQ ID NOS : 31 

SOFTWARE: Patent In version 3.1 
; SEQ ID NO 9 

LENGTH: 2 83 
; TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-854-864-9 

Query Match 90.6%; Score 1488.5; DB 9; Length 283; 

Best Local Similarity 97.2%; Pred* No. 4.6e-113; 

Matches 275; Conservative 0; Mismatches 1; Indels 7; Gaps 1 

Qy 27 MAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTCQRYCNASVTNSVKGV DKTH 79 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I | | | M 

Db 1 MAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTCQRYCNASVTNSVKGTNAGGGGGDKTH 60 

QY 80 TCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEV 139 

I M I M I M M I II I I I I I I I I I I I I I || | | | | | | | | M | | | | | | | | | | | | | | | | | || | | 
Db 61 TCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEV 120 

QY 140 HNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPR 199 

I I I M I II I I I I I I I I I I I II I I I I II I I I I I I I I I || | | | | | | || | | | | || | | || | | | | 
Db 121 HNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPR 180 

QY 200 EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF 259 

I I M I II I I I I I I I I I I || I I I I | | | | | | | | | | | | | | | | | | || | | || | | | | | | | | | | | | | 
Db 181 EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF 240 

QY 2 60 FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 

I M I I I I I I I I I II I I I II I I I I I I II II I I I I I I I I I I I I I I 
Db 241 FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 2 83 



RESULT 3 
US-09-855-158-9 

; Sequence 9, Application US/09855158 

; Publication No. US2 0020086018A1 

; GENERAL INFORMATION: 

; APPLICANT: THEILL, LARS EYDE 

; APPLICANT : YU, GANG 

; TITLE OF INVENTION: METHODS AND COMPOSITIONS OF MATTER CONCERNING APRIL/G70, 
BCMA, BLYS/AGP- 

; TITLE OF INVENTION: 3, AND TACI 
; FILE REFERENCE: A-686A 

; CURRENT APPLICATION NUMBER: US/09/855,158 

; CURRENT FILING DATE: 2001-09-11 

; PRIOR APPLICATION NUMBER: US 60/214,591 

; PRIOR FILING DATE: 2000-06-27 

; PRIOR APPLICATION NUMBER: US 60/204,039 



; PRIOR FILING DATE: 2000-05-12 
; NUMBER OF SEQ ID NOS : 30 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 9 

LENGTH: 283 
; TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-855-158-9 

Query Match 90.6%; Score 1488.5; DB 12; Length 283; 

Best Local Similarity 97.2%; Pred. No. 4.6e-113; 

Matches 275 ; Conservative 0; Mismatches 1 ; Indels 7 ; Gaps 1 
Qy 27 MAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTCQRYCNASVTNSVKGV DKTH 7 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I 

Db 1 MAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTCQRYCNASVTNSVKGTNAGGGGGDKTH 60 

Qy 80 TCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEV 139 

II I I 1 I II I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I II I I M I I I I 1 I I I I II I I I I I I I 

Db 61 T C P P C PAP EL L GGP S VFL FP P K P KDT LMI S RT P EVT C VWD VS HE D P E VK FN W YVDGVEV 12 0 

Qy 140 HNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPR 199 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 121 HNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPR 18 0 

Qy 2 00 EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF 259 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF 240 

Qy 260 FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 283 



RESULT 4 

US-09-854-864-10 

; Sequence 10, Application US/09854864 

; Patent No. US2 0020081296A1 

; GENERAL INFORMATION: 

; APPLICANT: THEILL, LARS EYDE 

; APPLICANT : YU, GANG 

; TITLE OF INVENTION: METHODS AND COMPOSITIONS OF MATTER CONCERNING APRIL/ G70 
BCMA, 

TITLE OF INVENTION: BLYS/AGP-3, AND TACI 
; FILE REFERENCE: A-686B 

; CURRENT APPLICATION NUMBER: US/ 09/854, 8 64 

; CURRENT FILING DATE: 2001-09-11 

; PRIOR APPLICATION NUMBER: US 60/204,039 

; PRIOR FILING DATE: 2000-05-12 

; PRIOR APPLICATION NUMBER: US 60/214,591 

; PRIOR FILING DATE: 2000-06-27 

; NUMBER OF SEQ ID NOS: 31 

; SOFTWARE : Patentln version 3.1 

; SEQ ID NO 10 

LENGTH: 281 

TYPE: PRT 
; ORGANISM: Mus mus cuius 



US-09-854-864-10 



Query Match 85.1%; Score 1398.5; DB 9; Length 281; 

Best Local Similarity 92.2%; Pred. No. 9.7e-106; 

Matches 2 61; Conservative 4; Mismatches 9; Indels 9; Gaps 2 

Qy 2 7 MAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTCQRYCNASWNSVKGV DKTH 7 9 

I I II : I I I I I I I I I I I I I I I I : I I I I I I I : I I I : I I I I III 
Db 1 MAQQCFHSEYFDSLLHACKPCHLRCSN — PPATCQPYCDPSVTSSVKGSYTGGGGGDKTH 58 

Qy 8 0 TCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEV 139 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I II I I II I I I I I | I I I 
Db 59 TCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEV 118 

Qy 140 HNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPR 199 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I 
Db 119 HNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPR 178 

Qy 200 EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF 259 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 17 9 EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF 238 

Qy 2 60 FLYS KLTVDKS RWQQGNVFS C S VMHEALHNH YTQKS L S LS P GK 302 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 239 FLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 281 



RESULT 5 

US-09-855-158-10 

; Sequence 10, Application US/09855158 

; Publication No. US2002008 6018A1 

; GENERAL INFORMATION: 

; APPLICANT: THEILL, LARS EYDE 

; APPLICANT : YU„ GANG 

; TITLE OF INVENTION: METHODS AND COMPOSITIONS OF MATTER CONCERNING APRIL/G70 
BCMA, BLYS/AGP- 

; TITLE OF INVENTION: 3, AND TACI 
; FILE REFERENCE: A-68 6A 

; CURRENT APPLICATION NUMBER: US/ 09/ 855 , 15 8 
; CURRENT FILING DATE: 2001-09-11 
; PRIOR APPLICATION NUMBER: US 60/214,591 
; PRIOR FILING DATE: 2000-06-27 

PRIOR APPLICATION NUMBER: US 60/204,039 
; PRIOR FILING DATE: 2000-05-12 
; NUMBER OF SEQ ID NOS : 30 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 10 

LENGTH: 2 81 

TYPE: PRT 
; ORGANISM: Mus mus cuius 
US-09-855-158-10 

Query Match 85.1%; Score 1398.5; DB 12; Length 281; 

Best Local Similarity 92.2%; Pred. No. 9.7e-106; 

Matches 2 61; Conservative 4; Mismatches 9; Indels 9; Gaps 2 



Qy 



27 MAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTCQRYCNASVTNSVKGV DKTH 79 



Db 


1 


Ov 


80 


Db 


59 


Qv 


140 


Db 


119 


Qy 


200 


Db 


179 


Qy 


260 


Db 


239 



I I I I : I I I I I I I II I I I I I I I : II I I I I I : I I I : I I I I | | | | 

MAQQCFHSEYFDSLLHACKPCHLRCSN—PPATCQPYCDPSVTSSVKGSYTGGGGGDKTH 58 



I I I I I M I I I I I I I I I I I II I I I I I I I I II I II I I I II I I I I I I I I I I | | | | | | | | | | | | 



HNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPI EKTI SKAKGQPR 19 9 
I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



EPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF 259 
I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I I I I I I I I I I I I I I I I 
EPQWTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSF 238 



I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I II I I I I I I II I 



RESULT 6 
US-10-258-368-8 

; Sequence 8, Application US/10258368 
; Publication No. US20040013674A1 
; GENERAL INFORMATION: 
; APPLICANT: Ambrose, Christine 

APPLICANT: Thompson, Jeffrey 
; APPLICANT: Schneider, Pascal 
; APPLICANT: Rennert, Paul 

TITLE OF INVENTION: Taci As As Anti-Tumor Agent 

FILE REFERENCE: BIOG0130 NP of PCT/US01/40626 
; CURRENT APPLICATION NUMBER: US/10/258 , 368 

CURRENT FILING DATE: 2002-10-21 
; PRIOR APPLICATION NUMBER: USSN 60/199,946 
; PRIOR FILING DATE: 2000-04-27 
; NUMBER OF SEQ ID NOS : 17 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 8 

LENGTH: 334 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-258-368~8 



Query Match 83.9%; Score 1379; DB 15; Length 334; 

Best Local Similarity 77.1%; Pred. No. 4.7e-104; 

Matches 262; Conservative 10; Mismatches 24; Indels 44; Gaps 3; 



Qy 1 METDTLLLWVLLLWVPGSTGDVTMLQMAGQCSQNEYFDSLLHACIPCQLRCSSNTP 56 

I I I I I II I I I I I I I I I I I I I I I I : : I : I I I I : I : I : : 

Db 1 METDTLLLWVLLLWVPGSTGDVR SCPEEQYWDPLLGTCMSCKTICNHQSQRTCA 54 

Qy 57 PLTCQRYCNASVTNSVK-GVDKTHTCP 82 

I I : I : : I I I I I I I I I 
Db 55 AFCRSLSCRKEQGKFYDHLLRDCISCASICGQHPKQCAYFCENKLRSPVNLPVDKTHTCP 114 



Qy 83 PCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNA 142 

I I I I I I I I 1 1 I M I I I I I I I I I I I I I I I I I I | | | | | | | | | | || | | M I I I I I I I II I I I I 



Db 



115 PCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNWYVDGVEVHNA 174 



Qy 



143 KTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQ 202 




Db 



175 KTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQ 234 



Qy 



2 03 VYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY 2 62 




Db 



235 VYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLY 294 



Qy 



2 63 SKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 3 02 




Db 



2 95 S KLTVDKS RWQQGNVFSCS VMHEALHNHYTQKS LS LS PGK 334 



RESULT 7 
US-10-258-368-6 

; Sequence 6, Application US/10258368 

; Publication No. US2 0 04 0 013 674A1 

; GENERAL INFORMATION : 

; APPLICANT: Ambrose, Christine 

; APPLICANT: Thompson, Jeffrey 

; APPLICANT: Schneider, Pascal 

; APPLICANT: Rennert, Paul 

; TITLE OF INVENTION: Taci As As Anti-Tumor Agent 

FILE REFERENCE: BIOG0130 NP of PCT/US01/ 4 0 62 6 

CURRENT APPLICATION NUMBER: US/ 1 0/2 58 , 368 
; CURRENT FILING DATE: 2002-10-21 
; PRIOR APPLICATION NUMBER: USSN 60/199,946 
; PRIOR FILING DATE: 2000-04-27 
; NUMBER OF SEQ ID NOS : 17 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 6 

LENGTH: 366 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-258-368-6 

Query Match 83.7%; Score 1376; DB 15; Length 366; 

Best Local Similarity 72.1%; Pred. No. 9.2e-104; 

Matches 264; Conservative 11; Mismatches 27; Indels 64; Gaps 3; 

Qy 1 METDTLLLWVLLLWVPGSTGDVTMLQMA GQCSQN 34 

I I I 1 I I 1 I I I I I II I I I I I I I I I I : | : 

Db 1 METDTLLLWVLLLWVPGSTGDVTMSGLGRSRRGGRSRVDQEERFPQGLWTGVAMRSCPEE 60 

Qy 35 EYFDSLLHACIPCQLRCSSNTP P 57 

■ : I : I I I I : I : I : : | 
Db 61 QYWDPLLGTCMSCKTICNHQSQRTCAAFCRSLSCRKEQGKFYDHLLRDCISCASICGQHP 120 

Qy 58 LTCQRYCNASVTNSVK-GVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTC 116 

I : I : : I I I II I I I I I I M I M I I I I I I I M I I II I I I II I II M I I I I 
Db 121 KQCAYFCENKLRSPVNLPVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTC 180 

Qy 117 WVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKC 176 

M I M I I I I I I I I I I I I I | | | | | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 VWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKC 24 0 



Qy 177 KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEW 236 

I M I I M I I I I I I I I I I I I I I I I I I I I | I | I i I I I I I I I | II | | | | | | | | M I I I I I I II 

Db 241 KVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEW 300 

Qy 237 ESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSL 296 

I i i I I I I I I I I I I I 1 I I I I I I I I i I 1 I I I I I II 1 I I I I I I I I I I I I I I I I I I I || | | | | | 
301 ESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSL 360 

Qy 297 SLSPGK 302 

I I I II I 

Db 361 SLSPGK 366 



RESULT 8 

US-10-411-037-50 

Sequence 50, Application US/10411037 
Publication No. US20040043446A1 
GENERAL INFORMATION: 
APPLICANT: Neose Technologies, Inc. 
APPLICANT : DeFrees , Shawn 
APPLICANT: Zopf, David 
APPLICANT: Bayer, Robert 
APPLICANT: Hakes, David 
APPLICANT: Chen, Xi 
APPLICANT : Bowe, Caryn 

TITLE OF INVENTION: ALPHA GALACTOSIDASE A: REMODELING AND GLYCOCON JUGATION OF 
ALPHA 

TITLE OF INVENTION: GALACTOSIDASE A 
FILE REFERENCE: 04 0853-01-5082 
CURRENT APPLICATION NUMBER: US/ 1 0/ 4 1 1 , 037 
CURRENT FILING DATE: 2003-04-09 
PRIOR APPLICATION NUMBER: US 60/328,523 
PRIOR FILING DATE: 2001-10-10 
PRIOR APPLICATION NUMBER: US 60/344,692 
PRIOR FILING DATE: 2001-10-19 
PRIOR APPLICATION NUMBER: US 60/387,292 
PRIOR FILING DATE: 2002-06-07 
PRIOR APPLICATION NUMBER: US 60/391,777 
PRIOR FILING DATE: 2002-06-25 
PRIOR APPLICATION NUMBER: US 60/396,594 
PRIOR FILING DATE: 2002-07-17 
PRIOR APPLICATION NUMBER: US 60/404,249 
PRIOR FILING DATE: 2002-08-16 
PRIOR APPLICATION NUMBER: US 60/407,527 
PRIOR FILING DATE: 2002-08-28 
NUMBER OF SEQ ID NOS : 75 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 50 
LENGTH: 32 0 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-411-037-50 



Query Match 82.8%; Score 1361; DB 12; 

Best Local Similarity 82.2%; Pred. No. 1.3e-102; 
Matches 263; Conservative 5; Mismatches 34; 



Length 32 0; 
Indels 18; Gaps 



3; 



Qy 


1 


METDTLLLWLLLWVPGSTGDVTM — LQMAGQ CSQNEYFDSLLHACIPCQL 


4 9 


Db 


1 


II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 : I MM: 1 : 1 1 
METDTLLLWVLLLWVPGSTGDVRRGPRSLRGRDAPAPTPCVPAECFDLLVRHCVACGLLR 


60 


Qy 


50 


RCSSNTPPLTCQRYCNASVTNSVKGVDKTHTCPPCPAPELLGGPSVFLFPPKP 

II 1 1 : 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TPRPKPAGASSPAPRTALQPQESVGAGAGEAAVDKTHTCPPCPAPELLGGPSVFLFPPKP 


102 


Db 


61 


12 0 


Qy 

Db 


103 
121 


KDTLMISRTPEWCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRVVSVLT 
1 M 1 1 1 1 1 II 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | | M 1 1 1 1 1 1 1 1 1 1 1 
KDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLT 


162 
180 


Qy 


163 


VLHQDWLNGKEYKCKVSNKA1PAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | M M 1 1 1 1 1 1 1 1 1 1 1 
VLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTC 


z. z. 


Db 


181 


9 A fl 
*C H. U 


Qy 

Db 


223 
241 


LVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSV 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 I I I I I I I I I I 
LVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSV 


282 
300 


Qy 


283 


MHEALHNHYTQKSLSLSPGK 302 
1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 
MHEALHNHYTQKSLSLSPGK 32 0 




Db 


301 





RESULT 9 

US-10-411-026-50 

Sequence 50, Application US/10411026 
Publication No. US2 004 0063911A1 
GENERAL INFORMATION: 
APPLICANT: Neose Technologies, Inc. 
APPLICANT : DeFrees , Shawn 
APPLICANT: Zopf, David 
APPLICANT: Bayer, Robert 
APPLICANT: Hakes, David 
APPLICANT: Chen, Xi 

TITLE OF INVENTION: PROTEIN REMODELING METHODS AND PROTEINS/PEPTIDES PRODUCED 
BY THE 

TITLE OF INVENTION: METHODS 
FILE REFERENCE: 04 08 53-01-5 053 
CURRENT APPLICATION NUMBER: US/10/4 11 , 026 
CURRENT FILING DATE: 2003-04-09 
PRIOR APPLICATION NUMBER: US 60/328,523 
PRIOR FILING DATE: 2001-10-10 
PRIOR APPLICATION NUMBER: US 60/344,692 
PRIOR FILING DATE: 2001-10-19 
PRIOR APPLICATION NUMBER: US 60/387,292 
PRIOR FILING DATE: 2002-06-07 
PRIOR APPLICATION NUMBER: US 60/391,777 
PRIOR FILING DATE: 2002-06-25 
PRIOR APPLICATION NUMBER: US 60/396,594 
PRIOR FILING DATE: 2002-07-17 
PRIOR APPLICATION NUMBER: US 60/404,249 
PRIOR FILING DATE: 2002-08-16 
PRIOR APPLICATION NUMBER: US 60/407,527 
PRIOR FILING DATE: 2002-08-28 



; NUMBER OF SEQ ID NOS : 75 

; SOFTWARE: Patentln version 3.2 

; SEQ ID NO 50 

LENGTH: 32 0 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-411-026-50 

Query Match 82.8%; Score 1361; DB 12; Length 320; 

Best Local Similarity 82.2%; Pred. No. 1.3e-102; 

Matches 263; Conservative 5; Mismatches 34; Indels 18; Gaps 3 

Qy 1 METDTLLLWVLLLWVPGSTGDVTM — LQMAGQ CSQNEYFDSLLHACIPCQL — 4 9 

I I I I I I I II I I I I I I I I I I I I I : I : I I || I : I : I I 

Db 1 METDTLLLWVLLLWVPGSTGDVRRGPRSLRGRDAPAPTPCVPAECFDLLVRHCVACGLLR 60 

QY 50 RCSSNTPPLTCQRYCNASVTNSVKGVDKTHTCPPCPAPELLGGPSVFLFPPKP 102 

M I I : I I I I I I I I I M I II I I I I I I I I I I I I I I 

Db 61 TPRPKPAGASSPAPRTALQPQESVGAGAGEAAVDKTHTCPPCPAPELLGGPSVFLFPPKP 120 

Qy 103 KDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLT 162 

I I I I I I I I I M I I I I I I II I I I I I I I I I I I || | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
Db 121 KDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLT 180 

Qy 163 VLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTC 222 

I I I M II I I I I I I I I I I I I I I I I I I I || I I I I I I I || | | | | | | | || | | | | | | | | | | | | || 
Db 181 VLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTC 240 

Qy 223 LVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSV 2 82 

M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I 
Db 241 LVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSV 300 

Qy 283 MHEALHNHYTQKSLSLSPGK 302 

I II I I I I I I I I II I I I I I I 1 
Db 301 MHEALHNHYTQKSLSLSPGK 320 



RESULT 10 
US-10-193-616-14 

; Sequence 14, Application US/10193616 
; Publication No. US20030096355A1 
; GENERAL INFORMATION: 
; APPLICANT: Zhang, Ke 

; TITLE OF INVENTION: Isolation, Identification, and Characterization of 
; TITLE OF INVENTION: ymkzS, a novel 

TITLE OF INVENTION: member of the TNF-Receptor Supergene Family 
; FILE REFERENCE: 01017/35551A 

CURRENT APPLICATION NUMBER: US/10/193 , 616 
; CURRENT FILING DATE: 2002-07-11 
; PRIOR APPLICATION NUMBER: US/09/611,989 
; PRIOR FILING DATE: 2000-07-07 
; PRIOR APPLICATION NUMBER: US 60/143,137 
; PRIOR FILING DATE: 1999-07-07 
; NUMBER OF SEQ ID NOS: 15 
; SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 14 
LENGTH: 396 



; TYPE: PRT 

; ORGANISM: Mus musculus 
; FEATURE : 

OTHER INFORMATION: ymkz5-Fc fusion protein 
US-10-193-616-14 

Query Match 77.7%; Score 1276.5; DB 14; Length 396; 

Best Local Similarity 82.4%; Pred. No. 1.3e-95; 

Matches 244; Conservative 7; Mismatches 24; Indels 21; Gaps 5 

Qy 27 MAGQCSQNE YFD-SLLHACIPCQLRCSSNTPPL TCQRYCNASVT 69 

I II hi : | | | : | || | | : : | | : 

Db 102 MVADCSATSDRKCECQIGLYYYDPKFPESCRPC-TKCPQGIPVLQECNSTANTVCSSSVS 160 

Qy 7 0 NSV KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDP 12 6 

1 : I I I I M I I I I I I I M M I I II I I I I I I I 1 I I I I I I | | | | | | | | | M II I II I 

161 NAAAEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDP 220 

QY 127 EVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAP 18 6 

I I I I 1 I I I I M II I I I I I I I M I II II I I II I II I I I I I I I I I I I II I M I | | | | || | | | 

Db 221 EVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAP 2 80 

Qy 187 IEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNY 246 

I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I II I I I I I I I I II I I I I I I I I || | | | | | | 
Db 281 IEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNY 340 

Qy 247 KTT PPVLDSDGS FFL YSKLTVDKS RWQQGNVFSCSVMHEALHNHYTQKS LSLS PGK 302 

M I I I M M I I I I II I I I I II I II I I 1 I II I II I I I I I I I I I II I II I 1 I | | | | | | 

Db 341 KTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKS LSLS PGK 396 



RESULT 11 
US-10-152-363A-62 

; Sequence 62, Application US/10152363A 

; Publication No. US20030103986A1 

; GENERAL INFORMATION: 

; APPLICANT: Rixon, Mark W. 

; APPLICANT: Gross, Jane A. 

; TITLE OF INVENTION: TACI-Immunoglobulin Fusion Proteins 
; FILE REFERENCE: 01-2 0 

; CURRENT APPLICATION NUMBER: US/ 1 0/ 152 , 3 63A 

CURRENT FILING DATE: 2002-05-20 
; PRIOR APPLICATION NUMBER: 60/293,343 
; PRIOR FILING DATE: 2001-05-24 

NUMBER OF SEQ ID NOS : 7 0 

SOFTWARE: Fast SEQ for Windows Version 3.0 
; SEQ ID NO 62 
; LENGTH: 332 
TYPE: PRT 

ORGANISM: Artificial Sequence 
; FEATURE : 

OTHER INFORMATION: Fusion protein. 
US-10-152-363A-62 

Query Match 77.4%; Score 1271; DB 14; Length 332; 

Best Local Similarity 72.2%; Pred. No. 2.9e-95; 

Matches 239; Conservative 15; Mismatches 37; Indels 40; Gaps 2; 



Qy 9 WVLLLWVPGSTGDVTMLQMAGQCSQNEYFDSLLHACIPCQLRCSSNTP 56 

I : I : I : I : I I : : I : I I I | : I : | : : 

Db 5 WIFLFLLSGTAG VLSAMRSCPEEQYWDPLLGTCMSCKTICNHQSQRTCAAFCRSLSC 61 

Qy 57 PLTCQRYCNASVTNSVKGVDKTHTCPPCPAPELLG 91 

I I : I : : I I II I II I I I I I I I I 
Db 62 RKEQGKFYDHLLRDCI SCAS I CGQHPKQCAYFCENKLRSEPKS SDKTHTCPPCPAPEAEG 121 

Qy 92 GPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQY 151 

I I M I I I I I II I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 122 APSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQY 181 

Qy 152 NSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRD 211 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 182 NSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPSSIEKTISKAKGQPREPQVYTLPPSRD 241 

Qy 212 ELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSR 271 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 242 ELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSR 301 

Qy 272 WQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 302 WQQGNVFS CSVMHEALHNHYTQKS LSLS PGK 332 



RESULT 12 
US-09-792-200B-10 

Sequence 10, Application US/09792200B 
Patent No. US20020042368A1 
GENERAL INFORMATION: 
APPLICANT: Immunex Corporation 
APPLICANT: Fanslow, William C. 
APPLICANT: Poindexter, Kurt 
APPLICANT: Cerretti, Douglas P. 
APPLICANT: Black, Roy A. 

TITLE OF INVENTION: INTEGRIN ANTAGONISTS 
FILE REFERENCE: 2958-A 

CURRENT APPLICATION NUMBER: US/ 09/ 7 92 , 2 00B 
CURRENT FILING DATE: 2001-02-23 
PRIOR APPLICATION NUMBER: US 60/184,865 
PRIOR FILING DATE: 2000-02-25 
NUMBER OF SEQ ID NOS : 22 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 10 
LENGTH: 44 6 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Fusion Construct 
US-09-792-200B-10 



Query Match 77.2%; Score 1269/ DB 9; Length 446; 

Best Local Similarity 57.2%; Pred. No. 6e-95; 

Matches 261; Conservative 2; Mismatches 29; Indels 164; Gaps 5; 



Qy 



1 METDTLLLWVLLLWVPGSTG- 



DVTMLQMAGQC 31 



Db 1 METDTLLLWVLLLWVPGSTGTSCGNSRVDEGEECDPGIMYLNNDTCCNSDCT-LKEGVQC 59 

Qy 32 SQNEYFDSLLHACIPCQLRCSSNTPPLTCQRYCNAS 67 

I I I I I Mil: 

Db 60 SDRN SPCCKNCQFETAQKKCQEAINATCKGVSYCTGNSSECPPPGNAEDDT 110 

Qy 68 VTNSVK 73 

I I I 

Db 111 VCLDLGKCKDGKCIPFCEREQQLESCACNETDNSCKVCCRDLSGRCVPYVDAEQKNLFLR 170 

Qy 74 GVDKTHTCPPCPA 8 6 

I I I II I I I II I 

Db 171 KGKPCTVGFCDMNGKCEKRVQDVIERFWDFIDQLSINTFGKFLADNRSCDKTHTCPPCPA 230 

Qy 87 PELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKP 14 6 

II I I I I I I I I I I I I I I I II I I I II I I I I I II I I 1 I I I I I I I I II I I I I I II I I II I I 

Db 231 P EAEGAP S VFL FP P K P KDT LMI S RT P E VT CVWD VS H E D P E VKFNW YVDGVEVHNAKT K P 290 

Qy 147 REEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTL 206 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 291 REEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTL 350 

Qy 2 07 PPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLT 2 66 

I M I II I II I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I II I I I I I M I I I I I 
Db 351 PPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLT 410 

Qy 267 VDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 411 VDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 446 



RESULT 13 
US-09-792-200B-6 

Sequence 6, Application US/09792200B 
Patent No. US2 002 0042368A1 
GENERAL INFORMATION: 
APPLICANT: Immunex Corporation 
APPLICANT: Fanslow, William C. 
APPLICANT: Poindexter, Kurt 
APPLICANT: Cerretti, Douglas P. 
APPLICANT: Black, Roy A. 

TITLE OF INVENTION: INTEGRIN ANTAGONISTS 
FILE REFERENCE: 2958-A 

CURRENT APPLICATION NUMBER: US/ 0 9/ 7 92 , 2 00B 
CURRENT FILING DATE: 2001-02-23 
PRIOR APPLICATION NUMBER: US 60/184,865 
PRIOR FILING DATE: 2000-02-25 
NUMBER OF SEQ ID NOS: 22 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 6 
LENGTH: 4 65 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER INFORMATION: Fusion Construct 
US-09-792-200B-6 



Query Match 77.0%; Score 1265.5; DB 9; Length 465; 

Best Local Similarity 56.1%; Pred. No. 1.2e-94; 

Matches 261; Conservative 11; Mismatches 30; Indels 163; Gaps 7 

Qy 1 METDTLLLWVLLLWVPGSTG DVTM L 25 

I I I I I I I I I I I I I I I I I I II | | 

Db 1 METDTLLLWVLLLWVPGSTGTSCGNGMVEQGEECDCGYSDQCKDECCFDANQPEGRKCKL 60 

Qy 2 6 QMAGQCSQNE- YFDSLLHACIP 4 6 

: III— I : I I I 

Db 61 KPGKQCSPSQGPCCTAQCAFKSKSEKCRDDSDCAREGICNGFTALCPASDPKPNFTDCNR 12 0 

Qy 47 CQLRCSSNTPPLTC 60 

1:1 III 

Db 121 HTQVCINGQCAGSICEKYGLEECTCASSDGKDDKELCHVCCMKKMDPSTCASTGSVQWSR 18 0 

QY 61 QRYCN ASVTNSV KGVDK 7 7 

: 11= I : :: : || 

Db 181 HFSGRTITLQPGSPCNDFRGYCDVFMRCRLVDADGPLARLKKAIFSPELYENIAERSCDK 240 

Qy 78 THTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGV 137 

I I I I M I I I I I I I I I I I II I 1 I I I I I I I I | || | | || | | | || || | || | | | | | | | | || | 
Db 241 THTCPPCPAPEAEGAPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGV 300 

Qy 138 EVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQ 197 

I M I I I I II I I I I 1 I M M I I I I I I I 1 I I I II I I I I I I I I I I I II I I I I I I I I I I | | | | | 
Db 301 EVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQ 360 

Qy 198 PREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDG 257 

I I I I M I I I I M I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I II I I I I I I | | | | | | | | 
Db 361 PREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDG 420 

Qy 258 SFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 

I I I M M I 1 I I I II I I I I I I I I II | | | | | || | | | | | | | | | | | | | | 
Db 421 SFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 465 



RESULT 14 
US-10-622-407-14 

; Sequence 14, Application US/10622407 

; Publication No. US20040018544A1 

; GENERAL INFORMATION: 

; APPLICANT: Saris, Chris 

; TITLE OF INVENTION: ISOLATION, IDENTIFICATION, AND CHARACTERIZATION OF TMST2 
A NOVEL 

; TITLE OF INVENTION: MEMBER OF THE TNF- RECEPTOR SUPERFAMILY OF GENES 

; FILE REFERENCE: 01017/35434B 

; CURRENT APPLICATION NUMBER: US/ 10/ 622 , 4 07 

; CURRENT FILING DATE: 2003-07-17 

; PRIOR APPLICATION NUMBER: US 09/612,033 

PRIOR FILING DATE: 2000-07-07 
; PRIOR APPLICATION NUMBER: US 60/143,063 
; PRIOR FILING DATE: 1999-07-09 
; NUMBER OF SEQ ID NOS : 15 
; SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 14 



; LENGTH: 39 8 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence: Fusion protein 
; OTHER INFORMATION: consisting of Mus musculus sequences and 

OTHER INFORMATION: Immunoglobulin sequences 
US-10-622-407-14 

Query Match 77.0%; Score 1264.5; DB 15; Length 398; 

Best Local Similarity 83.4%; Pred. No. 1.2e-94; 

Matches 242; Conservative 6; Mismatches 27; Indels 15; Gaps 5 

Qy 26 QMAGQCSQNE YFD-SLLHACIPCQLRCSSNTPPL-TCQRYCNASVTNSV 72 

•I II 1 = 1 : I I I : I III I : : | | 

Db 111 EMVADCSATSDRKCQCRTGLYYYDPKFPESCRPC-TKCPQGIPVLQECNSTANTVCSSSV 169 

Qy 73 KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNW 132 

M I I II I I I I I I I I I II I I I I I II I II I II I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db 170 SNVD-THTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNW 228 

Qy 133 YVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS 192 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | || | | | | | | | | | | | | || | 
Db 229 YVDGVEVHNAKT KP REEQYN S T YRWS VLT VLHQDWLNGKE YKCKVSNKALPAP I EKT I S 288 

Qy 193 KAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPV 252 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 89 KAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPV 34 8 

Qy 253 LDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 

I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 349 LDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 398 



RESULT 15 
US-10-258-368-15 

; Sequence 15, Application US/10258368 

; Publication No. US20040013674A1 

; GENERAL INFORMATION: 

; APPLICANT: Ambrose, Christine 

; APPLICANT : Thompson, Jeffrey 

; APPLICANT: Schneider, Pascal 

; APPLICANT: Rennert, Paul 

; TITLE OF INVENTION: Taci As As Anti-Tumor Agent 
; FILE REFERENCE: BIOG0130 NP of PCT/US01/40626 
; CURRENT APPLICATION NUMBER: US/10/258 , 368 
; CURRENT FILING DATE: 2002-10-21 

PRIOR APPLICATION NUMBER: USSN 60/199,946 

PRIOR FILING DATE: 2000-04-27 
; NUMBER OF SEQ ID NOS : 17 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 15 
LENGTH: 404 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-258-368-15 



Query Match 77.0%; Score 1264.5; DB 15; Length 404; 

Best Local Similarity 77.7%; Pred. No. 1.2e-94; 

Matches 244; Conservative 6; Mismatches 19; Indels 45; Gaps 



3; 



Qy 33 QNEYFDSLLHACIPC QLRCSSNTPPLTCQRYCNAS VTN 7 0 

I : : : I I i II I : I I 111 : I : II 

Db 92 QGKFYDHLLRDCISCASICGQHPKQCAYFCENKLRSPVNLPP-ELRRQRSGEVENNSDNS 150 

Qy 71 SVKGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 108 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 151 GRYQGLEHRGSEAS PALPGLKLSADQVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 210 

Qy 109 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 168 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 211 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 27 0 

Qy 169 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 22 8 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 271 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 330 

Qy 229 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 288 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 331 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 390 

Qy 289 NHYTQKSLSLSPGK 302 

I I I I I I I II I I I I I 
Db 391 NHYTQKSLSLSPGK 404 



Search completed: April 19, 2004, 13:25:48 
Job time : 48.2263 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

R Un on: April 19, 2004, 13:17:32 ; Search time 44.7407 Seconds 

(without alignments) 
2129.748 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



PCT-US03-05147-3 
1643 

1 METDTLLLWVLLLWVPGSTG. 



. MHEALHNHYTQKSLSLS PGK 302 



Scoring table : 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 1017041 seqs, 315518202 residues 

Total number of hits satisfying chosen parameters: 1017041 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : SPTREMBL 25:* 



1: 




sp archea:* 


2: 




sp bacteria:* 


3: 




sp fungi:* 


4 




sp human:* 


5 




sp invertebrate:* 


6 




sp mammal : * 


7 




sp mhc:* 


8 




sp organelle:* 


9 




sp_phage : * 


lO- 


sp plant:* 


ll 


sp rodent:* 


12 


sp virus : * 


13 


sp vertebrate:* 


14 


sp unclassified:* 


15 


; sp rvirus : * 


16 


: sp bacteriap:* 


17 


: sp archeap:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


1243 


75. 


7 


469 


4 


Q7Z7P5 


Q7z7p5 homo 


sapien 


2 


1243 


75. 


7 


470 


4 


Q7Z5W1 


Q7z5wl homo 


sapien 


3 


1239 


75. 


4 


482 


4 


Q7Z351 


Q7z351 homo 


sapien 


4 


1237 


75. 


3 


679 


4 


Q96PQ8 


Q96pq8 homo 


sapien 


5 


1163 


70. 


8 


354 


4 


Q86TT2 


Q86tt2 homo 


sapien 


6 


1159 


70. 


5 


521 


4 


Q8N4Y9 


Q8n4y9 homo 


sapien 


7 


1148 


69. 


9 


509 


4 


Q8NF17 


Q8nfl7 homo 


sapien 


8 


1142 


69. 


5 


473 


4 


Q8TC63 


Q8tc63 homo 


sapien 


9 


917.5 


55. 


8 


337 


6 


Q95M34 


Q95m34 equus 


cabal 


10 


856 


52. 


1 


470 


11 


Q7TMK1 


Q7tmkl mus 


musculu 


11 


841.5 


51. 


2 


469 


11 


Q8R3V9 


Q8r3v9 mus 


musculu 


12 


837.5 


51. 


0 


463 


11 


Q99LC4 


Q991c4 mus 


musculu 


13 


833.5 


50. 


7 


437 


11 


Q9R1A4 


Q9rla4 mus 


musculu 


14 


826. 5 


50. 


3 


473 


11 


Q9D8L4 


Q9d814 mus 


musculu 


15 


815.5 


49. 


6 


468 


11 


Q99L31 


Q99131 mus 


musculu 


16 


815.5 


49. 


6 


473 


11 


Q99L25 


Q99125 mus 


musculu 


17 


790.5 


48 . 


1 


473 


11 


Q91Z05 


Q91z05 mus 


musculu 


18 


790. 5 


48. 


1 


474 


11 


Q8R3H6 


Q8r3h6 mus 


musculu 


19 


358 


21. 


8 


375 


4 


Q86TT1 


Q8 6ttl homo 


sapien 


20 


358 


21. 


8 


597 


4 


Q9BU10 


Q9bul0 homo 


sapien 


21 


358 


21. 


8 


597 


4 


Q9BQB8 


Q9bqb8 homo 


sapien 


22 


358 


21. 


8 


597 


4 


Q96BB9 


Q96bb9 homo 


sapien 


23 


349 


21. 


2 


614 


11 


Q7TMT6 


Q7tmt6 mus 


musculu 


24 


348 


21. 


2 


588 


4 


Q8WUX4 


Q8wux4 homo 


sapien 


25 


348 


21. 


2 


613 


4 


Q96EY0 


Q96ey0 homo 


sapien 


26 


348 


21. 


2 


613 


4 


Q8WUK1 


Q8wukl homo 


sapien 


27 


348 


21. 


2 


614 


4 


Q96GA6 


Q96ga6 homo 


sapien 


28 


348 


21. 


2 


618 


4 


Q9 6AA6 


Q96aa6 homo 


sapien 


29 


347 


21. 


1 


613 


11 


Q8VCX7 


Q8vcx7 mus 


musculu 


30 


316. 5 


19. 


3 


587 


13 


Q7T0R1 


Q7t0rl xenopus lae 


31 


280 


17. 


0 


384 


4 


Q9UP60 


Q9up60 homo 


sapien 


32 


280 


17. 


0 


493 


4 


Q8NCL6 


Q8ncl6 homo 


sapien 


33 


280 


17, 


0 


494 


4 


Q96K68 


Q96k68 homo 


sapien 


34 


280 


17. 


0 


496 


4 


Q96KX8 


Q96kx8 homo 


sapien 


35 


277 


16. 


9 


496 


4 


Q96DK0 


Q96dk0 homo 


sapien 


36 


277 


16 


9 


499 


4 


Q8N5K4 


Q8n5k4 homo 


sapien 


37 


275 


16 


7 


497 


4 


Q8WY24 


Q8wy24 homo 


sapien 


38 


275 


16 


7 


500 


4 


Q9BRV0 


Q9brv0 homo 


sapien 


39 


274 


16 


.1 


684 


13 


Q90544 


Q90544 ginglymosto 


40 


273.5 


16 


. 6 


478 


4 


Q7Z379 


Q7z37 9 homo 


sapien 


41 


273.5 


16 


. 6 


492 


4 


Q7Z374 


Q7z374 homo 


sapien 


42 


271.5 


16 


. 5 


416 


4 


Q9NPP6 


Q9npp6 homo 


sapien 


43 


269. 5 


16 


. 4 


486 


11 


Q91Z07 


Q91z07 mus 


musculu 


44 


269. 5 


16 


. 4 


487 


11 


Q99KA4 


Q99ka4 mus 


musculu 


45 


268.5 


16 


. 3 


426 


11 


Q9DCD9 


Q9dcd9 mus 


musculu 



ALIGNMENTS 



RESULT 1 
Q7Z7P5 

ID Q7Z7P5 PRELIMINARY; PRT ; 469 AA. 

AC Q7Z7P5; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 



DE Hypothetical protein. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Spleen; 

RX MEDLINE=22388257; PubMed-124 77 932 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. f Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L . , Marusina K . r Farmer A. A. , Rubin G.M., Hong L . , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T . L . , Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P . , Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E. , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A. , Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W. r Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., 

RA Krzywinski M.I., Skalska U. f Smailus D.E., Schnerch A., Schein J.E,, 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Spleen; 

RA Strausberg R. ; 

RL Submitted (APR-2003) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; BC051328; AAH51328.1; -. 

KW Hypothetical protein. 

SQ SEQUENCE 469 AA; 51395 MW; C8D5BE12BAAF7 95C CRC64; 

Query Match 75.7%; Score 1243; DB 4; Length 469; 

Best Local Similarity 92.1%; Pred. No. 1.7e-109; 

Matches 234; Conservative 1; Mismatches 11; Indels 8; Gaps 2 

CSSNTPPLTCQRYCNASVTNSV — KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 108 
I : I I I I 1 I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 



Qy 


51 


Db 


222 


Qy 


109 


Db 


276 


Qy 


169 


Db 


336 


Qy 


229 


Db 


396 



S RT P E VT CVWDVS H E D P EVK FNW YVD GVE VHN AKT K P RE EQ YN STY RWS VL T VLH Q DW 168 

I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 

SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 335 

LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 228 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I i I I 

LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 395 

PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 2 88 

I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 4 55 



Qy 289 NHYTQKSLSLSPGK 302 

II I I I I I I I II I I I 
Db 456 NHYTQKSLSLSPGK 469 



RESULT 2 




Q7Z5W1 




ID 


Q7Z5W1 PRELIMINARY; PRT; 470 AA. 




AC 


Q7Z5W1; 




DT 


01-OCT-2003 (TrEMBLrel. 25, Created) 




DT 


01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 




DT 


01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 




DE 


Hypothetical protein . 




OS 


Homo sapiens (Human) . 




oc 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 




oc 


Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 




ox 


NCBI TaxID=9606; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


TISSUE=Spleen; 




RX 


MEDLINE=22388257; PubMed=12477932 ; 




RA 


Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 




RA 


Klausner R.D., Collins F.S., Wagner L . , Shenmen CM., Schuler G.D 


* r 


RA 


Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 




RA 


Hopkins R.F., Jordan H., Moore T . , Max S.I., Wang J., Hsieh F . , 




RA 


Diatchenko L., Marusina K., Farmer A. A. , Rubin G.M., Hong L . , 




RA 


Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T 


. E . , 


RA 


Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. 


f 


RA 


Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S 


.J., 


RA 


Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H 


• r 


RA 


Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S 


.w.. 


RA 


Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 




RA 


Fahey J., Helton E . , Ketteman M. , Madan A., Rodrigues S., Sanchez 


A., 


RA 


Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 




RA 


Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 




RA 


Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., Butterfield 


Y.S., 


RA 


Krzywinski M.I., Skalska U. , Smailus D.E., Schnerch A., Schein J. 


E. , 


RA 


Jones S.J., Marra M.A. ; 




RT 


"Generation and initial analysis of more than 15,000 full-length 


human 


RT 


and mouse cDNA sequences. "; 




RL 


Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 




RN 


[2] 




RP 


SEQUENCE FROM N.A. 




RC 


TISSUE=Spleen; 




RA 


Strausberg R. ; 




RL 


Submitted (JUN-2003) to the EMBL/ GenBank/DDBJ databases. 




DR 


EMBL; BC053984; AAH53984.1; -. 




KW 


Hypothetical protein. 




SQ 


SEQUENCE 470 AA; 51204 MW; 77 8CF3452 14 83E1A CRC64; 





Query Match 75.7%; Score 1243; DB 4; Length 470; 

Best Local Similarity 92.1%; Pred. No. 1.7e-109; 

Matches 234; Conservative 1; Mismatches 11; Indels 8; Gaps 



Qy 51 CSSNTPPLTCQRYCNASVTNSV--KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 1 



Db 



223 CNVNHKP SNTKVDKKVEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 276 



Qy 109 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 168 

| | I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 277 SRTPEVTCVVVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 336 

Qy 169 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 228 

I || I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I M I I I I I I 
Db 337 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 396 

Qy 229 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 288 

I I I I I I I I I II I I I I I I I I I I I I I II I I I I I II I I M I I I I I I I I I I I I II I I I I I I I I I 
Db 397 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 456 

Qy 289 NHYTQKSLSLSPGK 302 

I I I I I I I I II I I I I 
Db 457 NHYTQKSLSLSPGK 470 



RESULT 3 
Q7Z351 

ID Q7Z351 PRELIMINARY; PRT; 482 AA. 

AC Q7Z351; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein DKFZp686N02209 . 

GN DKFZP686N02209. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Human rectum tumor; 

RA Bloecker H., Boecher M. , Mewes H.W., Weil B., Amid C, Osanger A., 

RA Fobo G., Han M. , Wiemann S.; 

RL Submitted (JUN-2 003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BX538118; CAD98026.1; -. 

KW Hypothetical protein. 

SQ SEQUENCE 482 AA; 52852 MW; EDA75F1901D1A034 CRC64; 

Query Match 75.4%; Score 1239; DB 4; Length 482; 

Best Local Similarity 91.7%; Pred. No. 4.3e-109; 

Matches 233; Conservative 1; Mismatches 12; Indels 8; Gaps 2; 

Qy 51 CSSNTPPLTCQRYCNASVTNSV--KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 108 

I : I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I M I I 

Db 235 CNVNHKP SNTKVDKKVEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 288 

Qy 109 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 168 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I M 

Db 289 SRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 348 

Qy 169 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 228 

I I II I I II I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I I 
Db 349 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 408 



Qy 229 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 288 

1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 

Db 409 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEGLH 468 

Qy 289 NHYTQKSLSLSPGK 302 

II I I II I I I I I I I I 
Db 469 NHYTQKSLSLSPGK 482 



RESULT 4 
Q96PQ8 



ID 
AC 
DT 
DT 
DT 
DE 
OS 

oc 
oc 
ox 

RN 
RP 
RX 
RA 
RT 
RT 
RL 
RN 
RP 
RA 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 



Q96PQ8 
Q96PQ8; 
01-DEC-2001 
01-JUN-2003 
01-OCT-2003 



PRELIMINARY; 



PRT; 



679 AA. 



Created) 

Last sequence update) 
Last annotation update) 
immunocon jugate . 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



(TrEMBLrel. 19, 
(TrEMBLrel. 24, 
(TrEMBLrel. 25, 
Factor VII active site mutant 
Homo sapiens (Human) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
NCBI_TaxID=9606; 
[1] 

SEQUENCE FROM N.A. 

MEDLINE=2147744 8; PubMed=l 15 93034 ; 
Hu Z . , Garen A. ; 

"Targeting tissue factor on tumor vascular endothelial cells and tumor 
cells for immunotherapy in mouse models of prostatic cancer."; 
Proc. Natl. Acad. Sci. U.S.A. 98:1218 0-12185(2001). 
[2] 

SEQUENCE FROM N.A. 
Hu Z . , Garen A. ; 

Submitted (FEB-2003) to the EMBL/ GenBank/DDBJ databases. 
EMBL; AF272774; AAK58686.2; 

C: extracellular; IEA. 
F: calcium ion binding; IEA. 
F: chymotrypsin activity; IEA. 
F: trypsin activity; IEA. 
P : proteolysis and peptidolysis ; IEA. 



GO; 
GO; 
GO; 
GO; 
GO; 



GO: 0005576; 
GO: 0005509; 
GO: 0004263; 
GO: 0004295; 
GO: 0006508; 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 



IPR000152; 
IPR009003; 
IPR000742; 
IPR001881; 
IPR001438; 
IPR006209; 
IPR002383; 
IPR006210; 
IPR007110; 
IPR003597; 
IPR003006; 
IPR001254; 
IPR001314; 
IPR000294; 
Pfam; PF00008; EGF; 2. 
Pfam; PF00594; gla; 1. 
Pfam; PF00047; ig; 2. 
Pfam; PF00089; trypsin; 1 



Asx_hydroxyl_S . 
Cys_Ser_trypsin . 
EGF_2 . 
EGF_Ca. 
EGF_II . 
EGF_like. 
GLA_blood. 
IEGF. 
Ig-like . 
Ig_cl. 
Ig_MHC. 
Peptidase_Sl . 
Peptidase_SlA. 
VitK_dep_GLA. 



DR PRINTS; PR00722; CHYMOTRYPSIN . 

DR PRINTS; PR00010; EGFBLOOD. 

DR PRINTS; PR00001; GLABLOOD . 

DR SMART; SM00181; EGF; 2. 

DR SMART; SM0 017 9; EGF_CA; 1. 

DR SMART; SM00069; GLA; 1. 

DR SMART; SM00407; IGcl; 2. 

DR SMART; SM00020; Tryp_SPc; 1. 

DR PROSITE; PS00010; ASX_HYDROXYL ; 1. 

DR PROSITE; PS00022; EGF_1; 1. 

DR PROSITE; PS01186; EGF_2 ; 1. 

DR PROSITE; PS01187; EGF_CA; 1. 

DR PROSITE; PS00011; GLU_CARBOX YLAT I ON ; 1. 

DR PROSITE; PS50835; IG_LIKE; 2. 

DR PROSITE; PS00290; IG_MHC; 1. 

DR PROSITE; PS50240; TRYPSIN_DOM; 

DR PROSITE; PS00134; TRYPSIN_HIS; 

DR PROSITE; PS00135; TRYPSIN_SER; 

SQ SEQUENCE 679 AA; 75552 MW; 



1. 
1. 
1. 

0B0023AE70A067A1 CRC64; 



Query Match 75.3%; Score 1237; DB 4; Length 679; 

Best Local Similarity 99.1%; Pred. No. l.le-108; 

Matches 22 8; Conservative 0; Mismatches 2; Indels 0; 



Gaps 



0; 



QY 



Db 



73 KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFNW 132 
I I I I I II I I I I I I I II I I I M I I M I I I I I I I I I I I I 1 I I I I I I I I I 1 I I I I I I I M I 
450 KSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEVKFNW 509 



QY 
Db 

Qy 

Db 

Qy 

Db 



133 YVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS 192 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I 
510 YVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTIS 569 

193 KAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPV 252 

M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 
57 0 KAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPPV 629 

253 LDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I M I M I I I I I I I I 
630 LDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 679 



RESULT 5 
Q86TT2 

ID Q86TT2 PRELIMINARY; PRT; 354 AA. 

AC Q8 6TT2; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Human full-length cDNA clone CS0DI019YF20 of placenta of Homo sapiens 

DE (Human) (Fragment) . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Placenta; 



RA Genoscope; 

RL Submitted (FEB-2 003) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Placenta; 

RA Li W.B., Gruber C, Jessee J., Polayes D . ; 

RT "Full-length cDNA libraries and normalization."; 

RL Submitted (FEB-2003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BX248278; CAD62606.1; 

DR GO; GO: 0046821; C : extrachromosomal DNA; IEA. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig; 1. 

DR SMART; SM004 07; IGcl; 3. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 2. 

KW Plasmid. 

FT NONJTER 1 1 

SQ SEQUENCE 354 AA; 39125 MW; 23B8 0BF4D2B8 7A92 CRC64 ; 

Query Match 70.8%; Score 1163; DB 4; Length 354; 

Best Local Similarity 84.7%; Pred. No. 4.8e-102; 

Matches 221; Conservative 9; Mismatches 19; Indels 12; Gaps 4; 

Qy 46 PCQLRC SSNTPPLTCQRYCNASVTNSVKGVDKTHTCPPCPAPELLGGPSVFLFPPK 101 

|| M I : I II II I I I I I I I I I I I M I I I I I I I I I 

Db 102 PCP-RCPEPKSCDTPP-PCPR CPEPKSCDTPPPCPRCPAPELLGGPSVFLFPPK 153 

Qy 102 PKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVL 161 

I I I I 1 I I I I I I II I I I I I I I I I I I I I I : I I I II I I I I I I I I M I I I I I I I I I : II I II I 
D b 154 P KDT LMI S RT P EVT CVWDVS HE D P E VQ FKW YVDGVEVHNAKT K P RE EQ YN S T FRWSVL 213 

Qy 162 TVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT 221 

I I I 1 I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I : I : I I I I I I I I 

Db 214 TVLHQDWLNGKEYKCKVSNKALPAPIEKTISKTKGQPREPQVYTLPPSREEMTKNQVSLT 273 

Qy 222 CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCS 281 

I I I I I I I I I II I I I I II : I I I I I I I I I I I : I I I I I I I II I I I I M I I I I I I I I I : I I M 
Db 274 CLVKGFYPSDIAVEWESSGQPENNYNTTPPMLDSDGSFFLYSKLTVDKSRWQQGNIFSCS 333 

Qy 282 VMHEALHNHYTQKSLSLSPGK 302 

I I II I I I I : II I I I I I I I I I 
Db 334 VMHEALHNRFTQKSLSLSPGK 354 



RESULT 6 
Q8N4Y9 

ID Q8N4Y9 PRELIMINARY; PRT; 521 AA. 

AC Q8N4Y9; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 



OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Primary B-Cells; 

RA Strausberg R. ; 

RL Submitted (JUN-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC033178; AAH33178.1; -. 

DR PIR; A60764; A60764. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig; 4. 

DR SMART; SM00407; IGcl; 3. 

DR SMART; SM00406; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 2. 

KW Hypothetical protein. 

SQ SEQUENCE 521 AA; 57156 MW; 2AC7D22E72D6CAA2 CRC64; 

Query Match 70.5%; Score 1159; DB 4; Length 521; 

Best Local Similarity 84.3%; Pred. No. 1.9e-101; 

Matches 220; Conservative 10; Mismatches 19; Indels 12; Gaps 4; 

Qy 46 PCQLRC SSNTPPLTCQRYCNASVTNSVKGVDKTHTCPPCPAPELLGGPSVFLFPPK 101 

M II I : I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 269 PCP-RCPEPKSCDTPP-PCPR CPEPKSCDTPPPCPRCPAPELLGGPSVFLFPPK 320 

Qy 102 P KD T LM I S RT P EVT CVWDVS H E D P E VK FNW YVD GVEVHN AKT K P RE E Q YN S T Y R WS VL 161 

I I I I II II II I II I I II I I I I M I II I : I I I I I I I I I I I I I I I I I I I I : I I I : I I I I I I 
Db 321 PKDTLMISRTPEVTCWVDVSHEDPEVQFKWYVDGVEVHNAKTKPREEQFNSTFRWSVL 380 

Qy 162 TVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQWTLPPSRDELTKNQVSLT 221 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I : I M I I I I I 
Db 381 TVLHQDWLNGKEYKCKVSNKALPAPIEKTISKTKGQPREPQVYTLPPSREEMTKNQVSLT 44 0 

Qy 222 CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCS 281 

I I I I I I II I I I I I I I I I : I I I I I I I I I I I : I I I I I I I I I I I I I I II I I I I I I I I : I I I I 
Db 441 CLVKGFYPSDIAVEWESSGQPENNYNTTPPMLDSDGSFFLYSKLTVDKSRWQQGNIFSCS 500 

Qy 282 VMHEALHNHYTQKSLSLSPGK 302 

I I I I I I II : I I II I I I I I I I 
Db 501 VMHEALHNRFTQKSLSLSPGK 521 



RESULT 7 
Q8NF17 

ID Q8NF17 PRELIMINARY; PRT; 509 AA. 

AC Q8NF17; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE FLJ00385 protein (Fragment) . 

GN FLJ00385. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 



OX NCBI_TaxID=9 60 6; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Spleen; 

RA Jikuya H., Takano J., Kikuno R., Nagase T . , Ohara 0.; 

RT "The nucleotide sequence of a long cDNA clone isolated from human 

RT spleen."; 

RL Submitted (JUL-2 002) to the EMBL/GenBank/DDB J databases. 

DR EMBL; AK090464; BAC03445.1; -. 

DR PIR; A45874; A45874. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig; 3. 

DR SMART; SM00407; I Gel; 3. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_jMHC; 2. 

FT N0N__TER 1 1 

SQ SEQUENCE 509 AA; 56111 MW; 089498D807 6E8 63C CRC64; 

Query Match 69.9%; Score 1148; DB 4; Length 509; 

Best Local Similarity 84.2%; Pred. No. 2.1e-100; 

Matches 218; Conservative 10; Mismatches 19; Indels 12; Gaps 4; 

Qy 4 6 PCQLRC SSNTPPLTCQRYCNASVTNSVKGVDKTHTCPPCPAPELLGGPSVFLFPPK 101 

|| || I : I I I || I I I I I I I II I I I I I I I I I I I I I 

Db 188 PCP-RCPEPKSCDTPP-PCPR CPEPKSCDTPPPCPRCPAPELLGGPSVFLFPPK 239 

Qy 102 PKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVL 161 

I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I M I : I II : I I I I I I 
Db 24 0 PKDTLMISRTPEVTCVWDVSHEDPEVQFKWYVDGVEVHNAKTKPREEQFNSTFRWSVL 2 99 

Qy 162 TVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT 221 

I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I : I I I I I I M 
Db 300 TVLHQDWLNGKEYKCKVSNKALPAPIEKTISKTKGQPREPQVYTLPPSREEMTKNQVSLT 359 

Qy 222 CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCS 281 

I I I I I I I I I I I I I I I II : I I I I I II I I I I : II I I I I I I II I I I I II I I I I I I I I : I I I I 
Db 360 CLVKGFYPSDIAVEWESSGQPENNYNTTPPMLDSDGSFFLYSKLTVDKSRWQQGNIFSCS 419 

Qy 282 VMHEALHNHYTQKSLSLSP 300 

I I II I I I I : I I I I I I I I I 
Db 420 VMHEALHNRFTQKSLSLSP 438 



RESULT 8 
Q8TC63 

ID Q8TC63 PRELIMINARY; PRT; 473 AA. 

AC Q8TC63; 

DT 01-JUN-2002 (TrEMBLrel. 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
OX NCBI TaxID-9606; 



RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Kidney; 

RA Strausberg R. ; 

RL Submitted (MAR-2002) to the EMBL/GenBank/DDB J databases. 

DR EMBL; BC025985; AAH25985.1; -. 

DR GO; GO: 0005507; F: copper ion binding; IEA. 

DR GO; GO: 0005489; F:eiectron transporter activity; IEA. 

DR GO; GO: 0006118; P:electron transport; IEA. 

DR InterPro; IPR000923; BlueCu_l . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig; 4 . 

DR SMART; SM004 06; IGv; 1. 

DR PROSITE; PS00196; COPPER_BLUE; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 3. 

KW Hypothetical protein. 

SQ SEQUENCE 473 AA; 51986 MW; E2 9 92 0BO9BA369F5 CRC64; 

Query Match 69.5%; Score 1142; DB 4; Length 473; 

Best Local Similarity 85.4%; Pred. No. 7e-100; 

Matches 216; Conservative 10; Mismatches 25; Indels 2; Gaps 1; 

Qy 52 SSNTPPLTCQRYCNASVTNSVKGVDKTH — TCPPCPAPELLGGPSVFLFPPKPKDTLMI S 109 

I I II II II: : I I I I I I I I I I I II I I I I I I I I I I II I I 

Db 221 SLGTKTYTCNVDHKPSNTKVDKRVESKYGPPCPSCPAPEFLGGPSVFLFPPKPKDTLMIS 280 

Qy 110 RTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWL 169 

II I I II II I I I I I II I I I : I I I I I I I I I I I I I I I I I I I I I : I I II I I I I I I I I I I I I II 

Db 2 81 RTPEVTCVWDVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRWSVLTVLHQDWL 34 0 

Qy 170 NGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYP 229 

I I I I I I I I I I I I II: I I I I I I I I I I I I I I I I I I I I I I I :: I : I I I I I ) I I I I I M I I I 
Db 341 NGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYP 400 

Qy 230 SDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHN 289 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I : I I II I I I I I : I I I I I I I I I I I I I I I 
Db 401 SDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSRLTVDKSRWQEGNVFSCSVMHEALHN 460 

Qy 290 HYTQKSLSLSPGK 302 

I I I I II I I I I II 
Db 461 HYTQKSLSLSLGK 473 



RESULT 9 
Q95M34 

ID Q95M34 PRELIMINARY; PRT; 337 AA. 

AC Q95M34; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Immunogobulin gamma 1 heavy chain constant region (Fragment) . 

GN IGHC1. 

OS Equus caballus (Horse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 



OC Mammalia; Eutheria; Perissodactyla; Equidae; Equus . 

OX NCBI_TaxID=9796; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Wagner B. ; 

RL Submitted (DEC-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98383416; PubMed-97 17 67 1 ; 

RA Wagner B., Overesch G . , Sheoran A. , Holmes M. , Richards C, 

RA Leibold W. , Radbruch A.; 

RT "Organization of the equine immunoglobulin heavy chain constant region 

RT genes. III. Alignment of c-mu, c-gamma, c-epsilon and c-alpha genes."; 

RL Immunobiology 199:105-119(1998). 

DR EMBL; AJ300675; CAC44624.1; 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig; 2. 

DR SMART; SM00407; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 2. 

FT NONJTER 1 1 

SQ SEQUENCE 337 AA; 37438 MW; A60BF2B01DEFD1F6 CRC64; 

Query Match 55.8%; Score 917.5; DB 6; Length 337; 

Best Local Similarity 58.7%; Pred. No. 9.6e-79; 

Matches 172; Conservative 42; Mismatches 54; Indels 25; Gaps 



II II : | ::: | | :: I 



Qy 


12 


Db 


68 


Qy 


72 


Db 


113 


Qy 


132 


Db 


165 


Qy 


192 


Db 


225 


Qy 


250 


Db 


285 



VKGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFN 131 

II I I I I I I I I I I I I I : I I I I I I I I I I : I I I I I I I I I I I I I I : I : I I I I 
KCPKCPAPELLGGPSVFIFPPNPKDTLMITRTPEVTCWVDVSQENPDVKFN 164 



WYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTI 191 

11:11111 I I : I : I I I : I I I I I I I I I I : I I I I I : I I I : I I I I : I : I I I I I I : I I 
WYMDGVEVRTATTRPKEEQFNSTYRWSVLRIQHQDWLSGKEFKCKVNNQALPQPIERTI 224 

SKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQP — ENNYKTT 24 9 

: I ||: : I I I I I I I I I I : I : : I I : I I I I I III : I : I I : I I I I I I I I I 
TKTKGRSQEPQVYVLAPHPDELSKSKVSVTCLVKDFYPPEINIEWQSNGQPELETKYSTT 284 

PPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 

I I I I I : I I I I I I : I I : : I I I I I hi I I II I I I I I I I I I : : I MM 
QAQQDSDGSYFLYSKLSVDRNRWQQGTTFTCGVMHEALHNHYTQKNVSKNPGK 337 



RESULT 10 
Q7TMK1 

ID Q7TMK1 PRELIMINARY; PRT; 470 AA. 

AC Q7TMK1; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 



DE Hypothetical protein. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=CZECH II; TI SSUE=Breas t tumor; 

RX MEDLINE=22 38 8257; PubMed=124 77 932 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T . , Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K. , Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M. B . , Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P . , Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P. J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M. , Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E . D. , Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., 

RA Krzywinski M.I., Skalska U., Smailus D.E., Schnerch A., Schein J.E., 

RA Jones S.J., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=CZECH II; TISSUE=Breast tumor; 

RA Strausberg R. ; 

RL Submitted (AUG-2003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC055910; AAH55910.1; 

KW Hypothetical protein. 

SQ SEQUENCE 470 AA; 51727 MW; 6D9 0E4DF89 6BB0 90 CRC64 ; 

Query Match 52.1%; Score 856; DB 11; Length 470; 

Best Local Similarity 61.7%; Pred. No. le-72; 

Matches 161; Conservative 37; Mismatches 51; Indels 12; Gaps 4 

SSNTPPLTCQRYCN ASVTN S VKGVD KTHTCP — PCPAPELLGGPSVFLFPPK 101 

M I I II III : I I II II : I I I II I I :l I I I 

SSTWPSQTV — ICNVAHPASKTELIKRIEPRIPKPSTPPGSSCPPGNILGGPSVFIFPPK 2 69 

PKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVL 161 

Ml I I I I I I : I I I I I I I I I : I I : I : I : I I III I hill I I I I I : I I I I I 
PKDALMISLTPKVTCVWDVSEDDPDVHVSWFVDNKEVHTAWTQPREAQYNSTFRVVSAL 32 9 

TVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT 221 

: ||||: I II : I I I I : I I I I I I I I I : I I I I I I : : I I I I I : I I I : : : : I HIM 
PIQHQDWMRGKEFKCKVNNKALPAPIERTISKPKGRAQTPQVYTIPPPREQMSKKKVSLT 38 9 

CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCS 2 81 

III I : I : I I II I I : I : I I I I I : I I I I I : : I I I I I I I M | | I : I : I I 
CLVTNFFSEAI SVEWERNGELEQDYKNTPPILDSDGTYFLYSKLTVDTDSWLQGEIFTCS 44 9 



Qy 


52 


Db 


212 


Qy 


102 


Db 


270 


Qy 


162 


Db 


330 


Qy 


222 


Db 


390 



Qy 



Db 



282 VMHEALHNHYTQKSLSLSPGK 302 

I : I I I I I I I : I I I : I I I I I I 
450 WHEALHNHHTQKNLSRSPGK 470 



AC 
DT 
DT 
DT 
DE 
GN 
OS 

oc 
oc 
ox 

RN 
RP 
RA 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
SQ 



PRELIMINARY; 

(TrEMBLrel . 
(TrEMBLrel. 
(TrEMBLrel . 



PRT; 



469 AA. 



RESULT 11 
Q8R3V9 
ID Q8R3V9 
Q8R3V9; 
01-JUN-2002 
01-JUN-2002 
01-OCT-2003 
Hypothetical protein. 
IGH-4 . 

Mus musculus (Mouse) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 
NCBI_TaxID=10090; 
[1] 

SEQUENCE FROM N.A. 
Strausberg R. ; 

Submitted (MAR-2002) to the EMBL/ GenBank/DDBJ databases. 
EMBL; BC024405; AAH24405.1; -. 
PIR; B45837; B45837. 
MGD; MGI: 96446; Igh-4. 
InterPro; IPR007110; Ig-like. 

Ig_MHC. 
Ig v. 



21, Created) 

21, Last sequence update) 
25, Last annotation update) 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi; Muridae; Murinae; Mus, 



InterPro; IPR003006; 
InterPro; IPR003596; 
Pfam; PF00047; ig; 3. 
SMART; SM00406; IGv; 1. 
PROSITE; PS50835; IG_LIKE; 4. 
PROSITE; PS00290; IG_MHC; 1. 
Hypothetical protein. 
SEQUENCE 469 AA; 51976 MW; 



534793F155D05457 CRC64; 



Query Match 51.2%; Score 841.5; DB 11; Length 469; 

Best Local Similarity 51.4%; Pred. No. 2.5e-71; 

Matches 164; Conservative 45; Mismatches 71; Indels 39; 



Gaps 



Qy 


16 


Db 


158 


Qy 


52 


Db 


216 


Qy 


104 


Db 


271 


Qy 


164 


Db 


331 



PGSTGDVTMLQMAGQCSQNEYF- 
III : II II 



-DSLLHACI PCQLRC- 
I : I I I : 



51 



-SSNTPPLTCQRYCNASVTNSVKGVDKTHT CPP — CPAPELLGGPSVFLFPPKPK 103 

II I I II: I III II I II: I I I : I I I I I I 

PSSTWPSQTVT — CNVAHPASSTKVDKKIVPRDCGCKPCICTVPEV SSVFIFPPKPK 270 

DTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTV 163 

I I I : I I : I I I I I I I : I : I I I I : I : I : I I I I I I I : II I I I I I : I I I : I II I : 
DVLTITLTPKVTCVVVDISKDDPEVQFSWFVDDVEVHTAQTKPREEQFNSTFRSVSELPI 330 

LHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCL 22 3 
: I I I I I I I I I : I I : I : : I I I I I I I I I I I I I : I : I I I I I : I I : : : : I : : I I I I I : 



Qy 



224 VKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVM 2 83 



: I : I I I I I I : I I I I I II I I : : I : I I I : I : I I M I I i I : II I : I I I : 
Db 391 ITDFFPEDITVEWQWNGQPAENYKNTQPIMDTDGSYFVYSKLNVQKSNWEAGNTFTCSVL 450 

Qy 284 HEALHNHYTQKSLSLSPGK 302 

II I I I I : I : I II I I I I I 
Db 451 HEGLHNHHTEKSLSHSPGK 469 



RESULT 12 
Q99LC4 

ID Q99LC4 PRELIMINARY; PRT; 463 AA. 

AC Q99LC4; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Similar to RIKEN cDNA 1810060009 gene. 

GN IGH-4. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 

RL Submitted (FEB-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC003435; AAH03435.1; 

DR PIR; B45837; B45837. 

DR HSSP; P01842; 7 FAB . 

DR MGD; MGI: 96446; Igh-4. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig; 3. 

DR SMART; SM00406; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG__MHC; 1. 

SQ SEQUENCE 463 AA; 51007 MW; EAA67 4C6BBC307 8 3 CRC64 ; 

Query Match 51.0%; Score 837.5; DB 11; Length 4 63; 

Best Local Similarity 51.1%; Pred. No. 5.8e-71; 

Matches 163; Conservative 46; Mismatches 71; Indels 39; Gaps E 

'GSTGDVTMLQMAGQCSQNEYF DSLLHACI PCQLRC 51 

II : I I II | : | | | : 



I I I I I I : I III II I Ih I I I : I I I I I I 

'SSTWPSETVT — CNVAHPASSTKVDKKIVPRDCGCKPCICTVPEV SSVFIFPPKPK 2 64 



I | I : I I : I I I I I I I : I : I II I : I : I : I I I I I I I : I : I I I I I : I I I : I I I I 



I I I I I I I I I : I I : I : : I I I I I I I I II I MM: I I I I I : I I : : : : I : : I 1 I I I : 



Qy 


16 


Db 


152 


Qy 


52 


Db 


210 


Qy 


104 


Db 


265 


Qy 


164 


Db 


325 



Qy 224 VKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVM 283 

: I : I I | | | I : I I I I I I I I I :: I : I I I : I : I I I I I I I I : I I I : I I I : 
Db 385 ITDFFPEDITVEWQWNGQPAENYKKTQPIMDTDGSYFIYSKLNVQKSNWEAGNTFTCSVL 444 

Qy 284 HEALHNHYTQKSLSLSPGK 302 

II I I I I : I : M I I I I I I 
Db 445 HEGLHNHHTEKSLSHSPGK 463 



RESULT 13 
Q9R1A4 
ID Q9R1A4 
Q9R1A4 ; 
01-MAY-2000 
01-MAY-2000 
01-OCT-2003 



PRELIMINARY; 



PRT; 



437 AA. 



AC 
DT 
DT 
DT 
DE 
GN 
OS 

oc 
oc 
ox 

RN 
RP 
RA 
RT 
RT 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
FT 
FT 
SQ 



(TrEMBLrel. 13, Created) 
(TrEMBLrel. 13, Last sequence update) 
(TrEMBLrel. 25, Last annotation update) 
Gammal heavy chain of Mab7 (Fragment) . 
IGH-4. 

Mus musculus (Mouse) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
NCBI_TaxID=10 090; 
[1] 

SEQUENCE FROM N.A. 

Wilde K.G., Yu X., Ekramoddoullah A.K.M., Misra S.; 

"Cloning of cDNAs encoding for anti-white pine blister rust monoclonal 
antibody (Mab 7, its light and heavy chains) and construction of a 
single chain antibody (scFV)." ; 

Submitted (MAY-1999) to the EMBL/GenBank/DDBJ databases. 
EMBL; AF152372; AAD40243.1; -. 
PIR; B45837; B45837. 
PDB; 1CQK; ll-SEP-99. 
PDB; 1191; 25-DEC-02 . 
PDB; 1KCU; 11 -MAY- 02 . 
MGD; MGI : 9644 6; Igh-4. 
InterPro; IPR007110; Ig-like. 
InterPro; IPR003006; Ig_MHC. 
InterPro; IPR003596; Ig__v. 
Pfam; PF00047; ig; 3. 
SMART; SM00406; IGv; 1. 
PROSITE; PS50835; IG_LIKE; 4. 
PROSITE; PS00290; IG_MHC; 1. 
NONJTER 1 1 

NONJTER 437 4 37 

SEQUENCE 437 AA; 48142 MW; 



5C3A7BB3EE7D697C CRC64; 



Query Match 50.7%; Score 833.5; DB 11; Length 437; 

Best Local Similarity 50.8%; Pred. No. 1.3e-70; 

Matches 162; Conservative 47; Mismatches 71; Indels 39; 



Gaps 



QY 
Db 

Qy 



16 PGSTGDVTMLQMAGQCSQNEYF DSLLHACI PCQLRC 51 

III : I I II | : | | | : 

12 6 PGSAAQTNSMVTLG-CLVKGYFPEPVTVTWNSGSLSSGVHT-FPAVLQSDLYTLSSSVTV 183 



52 -SSNTPPLTCQRYCNASVTNSVKGVDKTHT — 
II I I II: I III 



-CPP — CPAPELLGGPSVFLFPPKPK 103 
II I II: I I I : t I I I II 



Db 



184 PSSTWPSETVT — CNVAHPASSTKVDKKIVPRDCGCKPCICTVPEV SSVFIFPPKPK 238 



Qy 104 DTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTV 163 

I I I : I I : I I I I I I I : I : I I I I : I : I : I I II I I i : I : t I I I I : I I I : I II I : 

Db 239 DVLTITLTPKVTCVWDISKDDPEVQFSWFVDDVEVHTAQTQPREEQFNSTFRSVSELPI 298 

Qy 164 LHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCL 223 

: I II I I I I I I : II : I : : I I I I I I I I I II I I : I : I I I I I : I I : : : : I : : II I I I : 
Db 299 MHQDWLNGKEFKCRVNSAAFPAPIEKTISKTKGRPKAPQVYTIPPPKEQMAKDKVSLTCM 358 

Qy 224 VKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVM 283 

: I : I M I I I : I I I I I I I I I :: I : I I I : I : I I I I I I I I : I I I : I I I : 
Db 359 ITDFFPEDITVEWQWNGQPAENYKNTQPIMDTDGSYFVYSKLNVQKSNWEAGNTFTCSVL 418 

Qy 284 HEALHNHYTQKSLSLSPGK 302 

II I I I I : I : I : I I I I I I 

Db 419 HEGLHNHHTEKNLSHSPGK 437 



RESULT 14 
Q9D8L4 

ID Q9D8L4 PRELIMINARY; PRT; 473 AA. 

AC Q9D8L4; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE 1810060O09Rik protein. 

GN IGH-1 OR 1810060O09RIK. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TI SSUE=Pancreas ; 

RX MEDLINE=2 1085660; PubMed=11217851 ; 

RA Kawai J., Shinagawa A. , Shibata K., Yoshino M. , Itoh M. , Ishii Y . , 

RA Arakawa T., Hara A., Fukunishi Y., Konno H., Adachi J., Fukuda S., 

RA Aizawa K. , Izawa M. , Nishi K. , Kiyosawa H. , Kondo S., Yamanaka I., 

RA Saito T., Okazaki Y., Gojobori T., Bono H., Kasukawa T . , Saito R. , 

RA Kadota K., Matsuda H.A., Ashburner M. , Batalov S., Casavant T . , 

RA Fleischmann W., Gaasterland T., Gissi C, King B., Kochiwa H., 

RA Kuehl P., Lewis S., Matsuo Y., Nikaido I., Pesole G. , Quackenbush J., 

RA Schriml L.M., Staubli F. , Suzuki R. , Tomita M. , Wagner L., Washio T . , 

RA Sakai K., Okido T., Furuno M. , Aono H., Baldarelli R. , Barsh G., 

RA Blake J., Boffelli D . , Bojunga N . , Carninci P., de Bonaldo M.F., 

RA Brownstein M.J., Bult C, Fletcher C, Fujita M. , Gariboldi M. , 

RA Gustincich S., Hill D., Hofmann M., Hume D.A., Kamiya M. , Lee N.H., 

RA Lyons P., Marchionni L., Mashima J., Mazzarelli J., Mombaerts P., 

RA Nordone P., Ring B., Ringwald M. , Rodriguez I., Sakamoto N., 

RA Sasaki H., Sato K. , Schoenbach C, Seya T., Shibata Y., Storch K.-F., 

RA Suzuki H., Toyo-oka K., Wang K.H., Weitz C, Whittaker C, Wilming L., 

RA Wynshaw-Boris A., Yoshida K., Hasegawa Y., Kawaji H., Kohtsuki S., 

RA Hayashizaki Y. ; 

RT "Functional annotation of a full-length mouse cDNA collection."; 

RL Nature 409:685-690(2 001). 

DR EMBL; AK007918; BAB25349.1; -. 



DR PIR; S26746; S26746. 

DR HSSP; P01842; 7 FAB . 

DR MGD; MGI : 96443; Igh-1. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig; 3. 

DR SMART; SM004 06; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IG_MHC; 1. 

SQ SEQUENCE 473 AA; 51699 MW; 9DED57A514 475FBB CRC64; 



Query Match 50.3%; Score 826.5; DB 11; Length 473; 

Best Local Similarity 50.8%; Pred. No. 6.7e-70; 

Matches 162; Conservative 50; Mismatches 74; Indels 33; Gaps 5 

Qy 17 GSTGD VTMLQMAGQCSQNEY-FDSLLHA CIPCQLRCSSNTPP 57 

| : I I ||: : I I : I : I I : : : : I I I I 

Db 155 GTTGSSWLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPALLQSGLYTLSSSVTVTSNTWP 214 



Qy 58 LTCQRYCNASVTNSVKGVD KTHTCPPCPAPELLGGPSVFLFPPKPK 103 

: | | Ml I : : I II I I I : I I I I I I I I : I I I I I 

Db 215 SQTITCNVAHPASSTKVDKKIEPRVPITQNPCPPLKECPPCAAPDLLGGPSVFIFPPKIK 274 

Qy 104 DTLMISRTPEWCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTV 163 

I I I I I : I I I I I I I I I I : I I : I : : I : I : I I I I I : I : I MM MM I : 
Db 275 D VLMI S L S PMVT CVWDVS E DD P DVQ I S W FVNNVEVHT AQT QTHRED YN S T LRWS AL P I 334 



Qy 164 LHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCL 223 

I I j I : : I I I : I I j I : I : I I I : I I I I I I M : I I MM Ml : I M I : M I I : 
Db 335 QHQDWMSGKEFKCKVNNRALPSPIEKTISKPRGPVRAPQVYVLPPPAEEMTKKEFSLTCM 394 

Qy 224 VKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVM 283 

: || I : : I I I : | MM I III I II II II M M I I I I I II I : : I : : I : I I I : 
Db 395 ITGFLPAEIAVDWTSNGRTEQNYKNTATVLDSDGSYFMYSKLRVQKSTWERGSLFACSW 454 



Qy 284 HEALHNHYTQKSLSLSPGK 3 02 

II MM I MM I II 
Db 455 HEGLHNHLTTKTISRSLGK 473 



RESULT 15 
Q99L31 

ID Q99L31 PRELIMINARY; PRT; 468 AA. 

AC Q99L31; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Similar to RIKEN cDNA 1810060009 gene. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBIJTaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 

RL Submitted (FEB-2001) to the EMBL/ GenBank/DDBJ databases. 



DR EMBL; BC003878; AAH03878.1; 

DR PDB; 2AP2; 24-NOV-99. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003006; Ig_MHC. 

DR InterPro; IPR003596; Ig_v. 

DR Pfam; PF00047; ig; 3. 

DR SMART; SM00406; IGv; 1. 

DR PROSITE; PS50835; IG_LIKE; 4. 

DR PROSITE; PS00290; IGJMHC; 1. 

SQ SEQUENCE 468 AA; 51661 MW; 96352328B3332ADB CRC64; 



Query Match 49.6%; Score 815.5; DB 11; Length 468; 

Best Local Similarity 54.8%; Pred. No. 7.3e-69; 

Matches 161; Conservative 43; Mismatches 77; Indels 13; Gaps 5; 

Qy 22 VTMLQMAGQCSQNEY-FDSLLHA CIPCQLRCSSNTPP LTCQRYCNAS VTNSVKG 74 

||: : | | : | : : | : : : : I : I I : II III I 

Db 175 VTLTWNSGSLSSGVHTFPAVLQSDLYTLSSSVTVTSSTWPSQSITCNVAHPASSTKVDKK 234 

Qy 75 VD KTHTCPP — CPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCWVDVSHEDPEV 128 

: : III I I I I I I I I I I I I : I II I I I I I I I : I I I I I I I I I I : I I : I 

Db 235 IEPRGPTIKPCPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPMVTCVWDVSEDDPDV 294 

Qy 129 KFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIE 188 

: : | : I : III I : I : I I I I I I I I I 1 : I I I I :: I I I : II M : I I I I I I I I I 
Db 2 95 QISWFWNVEVLTAQTQTHREDYNSTLRWSALPIQHQDWMSGKEFKCKVNNKALPAPIE 354 

Qy 189 KTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKT 248 

: I I I I I I I I I I I I I I : I : I I I I : I I I : I I I I I I I I : I I : I I II 
Db 355 RTISKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDI YVEWTNNGKTELNYKN 414 

Qy 249 TPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 

I I I I I I I I I : I : I I I I I : I I : I : I I I I : I I I I I I : I II I : I I I 
Db 415 TEPVLDSDGSYFMYSKLRVEKKNWVERNSYSCSWHEGLHNHHTTKSFSRTPGK 468 



Search completed: April 19, 2004, 13:22:56 
Job time : 45.7407 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



April 19, 2004, 13:17:02 ; Search time 12.428 Seconds 

(without alignments) 
1265.305 Million cell updates/sec 

PCT-US03-05147-3 
1643 

1 METDTLLLWVLLLWVPGSTG MHEALHNHYTQKSLSLSPGK 302 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 141681 seqs, 52070155 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



141681 



Database 



SwissProt 42:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


1243 


75. 


7 


330 


1 


GC1_HUMAN 


P01857 


homo sapien 


2 


1153 


70. 


2 


326 


1 


GC2_HUMAN 


P01859 


homo sapien 


3 


1143.5 


69. 


6 


290 


1 


GC3_HUMAN 


P01860 


homo sapien 


4 


1142 


69. 


5 


327 


1 


GC4_HUMAN 


P01861 


homo sapien 


5 


934 . 5 


56. 


9 


323 


1 


GC_RABIT 


P01870 


oryctolagus 


6 


903.5 


55. 


0 


329 


1 


GC2_CAVPO 


P01862 


cavia porce 


7 


856 


52. 


1 


329 


1 


GC3_MOUSE 


P22436 


mus musculu 


8 


849 


51. 


7 


333 


1 


GCB_RAT 


P20761 


rattus norv 


9 


845 


51. 


4 


398 


1 


GC3M_MOUSE 


P03987 


mus musculu 


10 


827 


50. 


3 


326 


1 


GC 1_RAT 


P20759 


rattus norv 


11 


826 


50. 


3 


329 


1 


GCC RAT 


P20762 


rattus norv 


12 


825.5 


50. 


2 


324 


1 


GC1_M0USE 


P01868 


mus musculu 


13 


820.5 


49. 


9 


393 


1 


GC1M_M0USE 


P01869 


mus musculu 


14 


819.5 


49. 


9 


330 


1 


GCAAJ40USE 


P01863 


mus musculu 


15 


818.5 


49. 


8 


335 


1 


GCAB_MOUSE 


P01864 


mus musculu 


16 


814.5 


49. 


6 


399 


1 


GCAM_MOUSE 


P01865 


mus musculu 


17 


796 


48. 


4 


322 


1 


GCA RAT 


P20760 


rattus norv 



18 


789.5 


48 


.1 


336 


1 


GCBJVtOUSE 


P01866 


mus musculu 


19 


784.5 


47 


.7 


405 


1 


GCBMJ40USE 


P01867 


mus musculu 


20 


362.5 


22 


. 1 


421 


1 


EPC_MOUSE 


P06336 


mus musculu 


21 


358 


21 


. 8 


454 


1 


MUC_HUMAN 


P01871 


homo sapien 


22 


357 


21 


.7 


455 


1 


MUCJVtOUSE 


P01872 


mus musculu 


23 


356 


21 


.7 


391 


1 


MUCB_HUMAN 


P04220 


homo sapien 


24 


353 


21 


.5 


429 


1 


EPC_RAT 


P01855 


rattus norv 


25 


352 


21 


.4 


428 


1 


EPC_HUMAN 


P01854 


homo sapien 


26 


349 


21 


2 


458 


1 


MUC_RABIT 


P03988 


oryctolagus 


27 


347 


21 


1 


476 


1 


MUCM__MOUSE 


P01873 


mus musculu 


28 


339 


20 


6 


479 


1 


MUCM_RABIT 


P04221 


oryctolagus 


29 


332 


20 


2 


450 


1 


MUC^CANFA 


P01874 


canis famil 


30 


332 


20 


2 


454 


1 


MUC_MESAU 


P06337 


mesocricetu 


31 


330. 5 


20 


1 


457 


1 


MUC_SUNMU 


P20768 


suncus muri 


32 


310 


18 


9 


438 


1 


HVC2__HETFR 


P23085 


heterodontu 


33 


295. 5 


18 


0 


299 


1 


ALC_RABIT 


P01879 


oryctolagus 


34 


294 


17 


9 


438 


1 


HVCS_HETFR 


P23087 


heterodontu 


35 


287.5 


17 


5 


446 


1 


MUC_CHICK 


P01875 


gallus gall 


36 


287 


17 


5 


370 


1 


HVC1 HETFR 


P23084 


heterodontu 


37 


286.5 


17 


4 


184 


1 


TR17_HUMAN 


Q02223 


homo sapien 


38 


285 


17 


3 


461 


1 


HVCM HETFR 


P23088 


heterodontu 


39 


284.5 


17 


3 


353 


1 


ALCl_GORGO 


P20758 


gorilla gor 


40 


283 


17 


2 


393 


1 


HVC3_HETFR 


P23086 


heterodontu 


41 


280 


17. 


0 


353 


1 


ALCl HUMAN 


P01876 


homo sapien 


42 


271.5 


16. 


5 


340 


1 


ALC2__HUMAN 


P01877 


homo sapien 


43 


262.5 


16. 


0 


344 


1 


ALC_MOUSE 


P01878 


mus musculu 


44 


251.5 


15. 


3 


481 


1 


MUCM_ICTPU 


P23735 


ictalurus p 


45 


184.5 


11. 


2 


513 


1 


SHS1_M0USE 


P97797 


m protein-t 



ALIGNMENTS 



RESULT 1 




GC1_ 


JHUMAN 




ID 


GC1 HUMAN STANDARD; PRT; 330 AA. 




AC 


P01857; 




DT 


21-JUL-1986 (Rel. 01, Created) 




DT 


21-JUL-1986 (Rel. 01, Last sequence update) 




DT 


15-MAR-2004 (Rel. 43, Last annotation update) 




DE 


Ig gamma- 1 chain C region . 




GN 


IGHGl. 




OS 


Homo sapiens (Human) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 


Euteleostomi ; 


OC 


Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 


Homo . 


OX 


NCBI TaxID=9606; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RX 


MEDLINE=8227 423 8; PubMed-62 8 74 32 ; 




RA 


Ellison J.W., Berson B.J., Hood L.E.; 




RT 


"The nucleotide sequence of a human immunoglobulin C 


gamma 1 gene. 


RL 


Nucleic Acids Res. 10:4071-4 07 9(1982). 




RN 


[2] 




RP 


SEQUENCE OF 1-135 (MYELOMA PROTEIN EU) . 




RX 


MEDLINE=71064024; PubMed=54 8 97 7 1 ; 




RA 


Cunningham B.A., Rutishauser U., Gall W.E., Gottlieb 


P.D., 


RA 


Waxdal M.J., Edelman G.M. ; 





RT "The covalent structure of a human gamma G- immunoglobulin . VII. Amino 

RT acid sequence of heavy-chain cyanogen bromide fragments H1-H4 . " ; 

RL Biochemistry 9:3161-317 0(197 0). 

RN [3] 

RP SEQUENCE OF 136-329 (EU) . 

RX MEDLINE=71064025; PubMed=553 0 8 42 ; 

RA Rutishauser U., Cunningham B.A., Bennett C, Konigsberg W.H., 

RA Edelman G.M. ; 

RT "The covalent structure of a human gamma G- immunoglobulin . 8. Amino 

RT acid sequence of heavy-chain cyanogen bromide fragments H5-H7."; 

RL Biochemistry 9:3171-3181(1970). 

RN [4] 

RP SEQUENCE (MYELOMA PROTEIN NIE) . 

RX MEDLINE=7 7 0702 69; PubMed=8 2 6475; 

RA Ponstingl H., Hilschmann N.; 

RT "The rule of antibody structure. The primary structure of a 

RT monoclonal IgGl immunoglobulin (myeloma protein Nie) . III. The 

RT chymotryptic peptides of the H-chain, alignment of the tryptic 

RT peptides and discussion of the complete structure."; 

RL Hoppe-Seyler's Z. Physiol. Chem. 357:1571-1604(1976). 

RN [5] 

RP SEQUENCE (MYELOMA PROTEIN KOL) , AND DISULFIDE BONDS. 

RX MEDLINE=83289131; PubMed=68 84 9 94 ; 

RA Schmidt W.E., Jung H.-D., Palm W., Hilschmann N. ; 

RT "Three-dimensional structure determination of antibodies. Primary 

RT structure of crystallized monoclonal immunoglobulin IgGl KOL, I."; 

RL Hoppe-Seyler's Z. Physiol. Chem. 364:713-747(1983). 

RN [6] 

RP DISULFIDE BONDS. 

RX MEDLINE=710 64 027; PubMed=4 923144 ; 

RA Gall W.E., Edelman G.M. ; 

RT "The covalent structure of a human gamma G-immunoglobulin . X. 

RT Intrachain disulfide bonds."; 

RL Biochemistry 9:3188-3196(1970). 

RN [7] 

RP DISULFIDE BONDS. 

RX MEDLINE=77070267; PubMed=10 02129 ; 

RA Dreker L., Schwarz J., Reichel W. , Hilschmann N. ; 

RT "Rule of antibody structure. The primary structure of a monoclonal 

RT IgGl immunoglobulin (myeloma protein Nie), I: Purification and 

RT characterization of the protein, the L- and H-chains, the 

RT cyanogen bromide cleavage products, and the disulfide bridges."; 

RL Hoppe-Seyler's Z. Physiol. Chem. 357:1515-154 0(197 6). 

RN [8] 

RP X-RAY CRYSTALLOGRAPHY (2.9 ANGSTROMS). 

RX MEDLINE=81208100; PubMed-7236608 ; 

RA Deisenhofer J. ; 

RT "Crystallographic refinement and atomic models of a human Fc fragment 

RT and its complex with fragment B of protein A from Staphylococcus 

RT aureus at 2.9- and 2 . 8-A resolution."; 

RL Biochemistry 2 0:2361-2370(1981). 

CC -!- MISCELLANEOUS: Nie has the G1M(17) allotypic marker, 97-K, and the 
CC G1M(1) markers, 239-D and 241-L. KOL and EU sequences have the 

CC G1M(3) marker and the G1M (non-1) markers. 

CC -!- MISCELLANEOUS: Nie also differs in the amidation states of 
CC 35, 116, 198, 269 and 272. 

CC -!- MISCELLANEOUS: EU also differs in the amidation states of residues 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 



155, 166, 177, 195, 198, 269, and 272 and in the order of residues 
268-272 . 

-!- MISCELLANEOUS: KOL also differs in the amidation states of 
residues 198, 267 and 272. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; J00228; AAC82527.1; ALT INIT. 



DR 


PIR; 


A93433; 


GHHU . 




DR 


PDB; 


1FC1; 


15 


-JUL- 


92 


DR 


PDB; 


1FC2; 


15 


-JUL- 


92 


DR 


PDB; 


1AJ7 ; 


12 


-NOV- 


97 


DR 


PDB; 


1D5B; 


09 


-FEB- 


00 


DR 


PDB; 


1D5I; 


09 


-FEB- 


00 


DR 


PDB; 


1D6V; 


04 


-OCT- 


00 


DR 


PDB; 


1DN2; 


17 


-MAY- 


00 


DR 


PDB; 


1E4K; 


06 


-JUN- 


01 


DR 


PDB; 


1FCC; 


20 


-JUL- 


95 


DR 


PDB; 


1HZH; 


12 


-JUN- 


02 


DR 


PDB; 


1I7Z; 


08 


-AUG- 


01 


DR 


PDB; 


HIS; 


16 


-MAY- 


01 


DR 


PDB; 


1IIX; 


16 


-MAY- 


01 


DR 


PDB; 


1L6X; 


10 


-APR- 


02 


DR 


PDB; 


2RCS; 


12 


-NOV- 


97 



DR Genew; HGNC:552 5; IGHG1. 

DR MIM; 147100; -. 

DR GO; GO: 0005624; C:membrane fraction; NAS . 

DR GO; GO:0003823; F: antigen binding; TAS . 

DR GO; GO: 0006955; P : immune response; NAS. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; IgJVTHC. 

DR Pfam; PF00047; ig; 3. 

DR SMART; SM00407; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 2. 

KW Immunoglobulin domain; Immunoglobulin C region; Glycoprotein; 

KW 3D-structure . 



FT 


NON_TER 


1 


1 






FT 


DOMAIN 


1 


98 


CHI . 




FT 


DOMAIN 


99 


110 


HINGE. 




FT 


DOMAIN 


111 


223 


CH2 . 




FT 


DOMAIN 


224 


330 


CH3. 




FT 


DISULFID 


27 


83 






FT 


DISULFID 


103 


103 


INTERCHAIN 


(WITH LIGHT CHAIN) 


FT 


DISULFID 


109 


109 


INTERCHAIN 


(WITH HEAVY CHAIN) 


FT 


DISULFID 


112 


112 


INTERCHAIN 


(WITH HEAVY CHAIN) 


FT 


DISULFID 


144 


204 






FT 


DISULFID 


250 


308 






FT 


CARBOHYD 


180 


180 


N-LINKED ( 


GLCNAC . . . ) . 


FT 


MOD RES 


330 
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281 


FT 
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287 
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FT 
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FT 


TURN 
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FT 
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FT 


TURN 


313 
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FT 


TURN 


316 
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FT 


STRAND 


320 
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SQ 


SEQUENCE 


330 AA; 


36: 



K -> R (IN G1M(3) MARKER). 

/FTId=VAR_003886. 

D -> E (IN G1M (NON-1 ) MARKER) 

/FTId=VAR_0038 87. 

L -> M (IN GlM(NON-l) MARKER) 

/ FTId=VAR 003888. 



.06 MW; 3770EE106C2FA33D CRC64; 



Query Match 75. 7%; 

Best Local Similarity 92.1%; 
Matches 234; Conservative 



Score 1243; DB 1; 
Pred. No. 4.9e-91; 
1; Mismatches 11; 



Length 330; 



Indels 



Gaps 



2; 



Qy 

Db 



51 CSSNTPPLTCQRYCNASVTNSV — KGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 108 

I : I I I I I I I II I I I I I I I I I I I I II I I I I II I I I I I I I I I I 

83 CNVNHKP SNTKVDKKVEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMI 136 



Qy 

Db 

Qy 

Db 



109 SRTPEVTCWVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 168 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I 
137 SRTPEWCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDW 196 



169 



228 



LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 

I I I I I i I I I I I II I I II I I I I I I I I I I I II I I I I I I I I I I I I I II I II I I I I I I I I I I I I 
197 LNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFY 256 



Qy 



22 9 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 2 88 



Db 257 PSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALH 316 

Qy 289 NHYTQKSLSLSPGK 302 

I I I I I I I I I I I I I I 
Db 317 NHYTQKSLSLSPGK 330 



RESULT 2 
GC2_HUMAN 

ID GC2_HUMAN STANDARD; PRT; 32 6 AA. 

AC P01859; 

DT 21-JUL-1986 (Rel. 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Ig gamma-2 chain C region. 

GN IGHG2. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE OF 2-326 FROM N.A. 

RX MEDLINE=82197621; PubMed=6804948 ; 

RA Ellison J.W. , Hood L.E.; 

RT "Linkage and sequence homology of two human immunoglobulin gamma 

RT heavy chain constant region genes. 1 '; 

RL Proc. Natl. Acad. Sci. U.S.A. 79:1984-198 8(1982). 

RN [2] 

RP SEQUENCE OF 88-115 FROM N.A. 

RC TISSUE=Fetal liver; 

RX MEDLINE=8300194 3; PubMed=68 11139 ; 

RA Takahashi N., Ueda S., Obata M. , Nikaido T., Nakai S., Honjo T . ; 

RT "Structure of human immunoglobulin gamma genes: implications for 

RT evolution of a gene family."; 

RL Cell 29:671-679(1982). 

RN [3] 

RP SEQUENCE OF 99-177 AND 310-326 FROM N.A. 

RC TISSUE=Fetal liver; 

RX MEDLINE=84235992; PubMed-632 9 67 6 ; 

RA Krawinkel U., Rabbi tts T.H.; 

RT "Comparison of the hinge-coding segments in human immunoglobulin gamma 

RT heavy chain genes and the linkage of the gamma 2 and gamma 4 subclass 

RT genes."; 

RL EMBO J. 1:403-407(1982). 

RN [4] 

RP SEQUENCE OF 1-325 (MYELOMA PROTEIN TIL) . 

RX MEDLINE=81007873; PubMed=6774012 ; 

RA Wang A.-C, Tung E., Fudenberg H.H.; 

RT "The primary structure of a human IgG2 heavy chain: genetic, 

RT evolutionary, and functional implications."; 

RL J. Immunol. 125:104 8-1054(1980). 

RN [5] 

RP SEQUENCE OF 1-85 AND 132-325 (MYELOMA PROTEIN ZIE) . 

RX MEDLINE=8 000 1357; PubMed=113 060 ; 

RA Connell G.E., Parr D.M., Hofmann T.; 

RT "The amino acid sequences of the three heavy chain constant region 



RT domains of a human IgG2 myeloma protein."; 

RL Can. J. Biochem. 57:758-767(1979). 

RN [6] 

RP SEQUENCE OF 238-275 (ZIE) . 

RX MEDLINE-80114419; PubMed=ll 8 92 0 ; 

RA Hofmann T. , Parr D.M.; 

RT "A note of the amino acid sequence of residues 381-391 of human 

RT immunoglobulins gamma chains."; 

RL Mol. Immunol. 16:923-925(197 9). 

RN [7] 

RP REVISIONS TO 25; 59; 60 AND 264-268 (ZIE) . 

RA Hofmann T., Parr D.M. ; 

RL Submitted (MAR-1980) to the PIR data bank. 

RN [8] 

RP SEQUENCE OF 1-121 (DOT) . 

RX MEDLINE=95255298; PubMed=7737190 ; 

RA Stoppini M. , Bellotti V., Negri A., Merlini G. , Garver F. , Ferri G. ; 

RT "Characterization of the two unique human anti-flavin monoclonal 

RT immunoglobulins."; 

RL Eur. J. Biochem. 228:886-893(1995). 

RN [9] 

RP DISULFIDE BONDS. 

RX MEDLINE=72033500; PubMed=4 94 04 72 ; 

RA Milstein C, Frangione B.; 

RT "Disulphide bridges of the heavy chain of human immunoglobulin G2 . " ; 

RL Biochem. J. 121:217-225(1971). 

RN [10] 

RP DISULFIDE BONDS. 

RX MEDLINE=69064124; PubMed=57 827 07 ; 

RA Frangione B., Milstein C, Pink J.R.L.; 

RT "Structural studies of immunoglobulin G."; 

RL Nature 221:145-148(1969). 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; J00230; AAB59393.1; -. 

DR PIR; A93906; G2HU. 

DR HSSP; P01857; 1FC1. 

DR Genew; HGNC:552 6; IGHG2 . 

DR MIM; 147110; 

DR GO; GO:0005624; C:membrane fraction; NAS . 

DR GO; GO:0003823; F:antigen binding; TAS . 

DR GO; GO:0006955; P:immune response; NAS. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig; 3. 

DR SMART; SM00407; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 2. 

KW Immunoglobulin domain; Immunoglobulin C region. 
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FT 


MOD_RES 
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REMOVED POST-TRANSLATIONALLY (PROBABLE) 


FT 
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60 
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S -> A (IN 


MYELOMA 


PROTEINS TIL & 


ZIE) . 


FT 










/ FT I d=VAR 003889. 






FT 


CONFLICT 


109 


109 




C -> S (IN 


REF. 3) . 






SQ 


SEQUENCE 


32 6 AA; 


358£ 


3 4 MW; 


8310878C6878CF9C 


CRC64; 




Query Match 




70 


2%; 


Score 1153; 


DB 1; 


Length 32 6; 




Best Local Similarity 


87 


6%; 


Pred. No. 6 


,2e-84; 







Matches 218; Conservative 



8; Mismatches 



15; Indels 



Gaps 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



61 QRY-CNASVTNSVKGVDKT HTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPE 113 

I I I I I I I I I I I I I I I I : I II I II I I II I I II I I I I I I I I 

7 9 QTYTCNVDHKPSNTKVDKTVERKCCVECPPCPAPP-VAGPSVFLFPPKPKDTLMISRTPE 137 

114 VTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKE 173 

I I I I I 1 I I I I I I I I I : I I I I I I I I I I I I I I II I I I II : I I I : I I I I I I I I : I I I I I I I I I 

138 VTCVWDVSHEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTFRWSVLTVVHQDWLNGKE 197 

174 YKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIA 233 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I : I I I I II I I I I II I I I I I I I I 

19 8 YKCKVSNKGLPAPIEKTISKTKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIA 257 



234 



293 



VEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQ 

I I I I I I I I I I I I I I I I II : I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I II I I I I I I 
258 VEWESNGQPENNYKTTPPMLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQ 317 



294 



302 



KSLSLSPGK 

I I I I I I I I I 
318 KSLSLSPGK 326 



RESULT 3 
GC3JHUMAN 

ID GC3_HUMAN STANDARD; PRT; 290 AA. 

AC P01860; 

DT 21-JUL-1986 (Rel. 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Ig gamma-3 chain C region (Heavy chain disease protein) (HDC) . 
GN IGHG3. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 



OX NCBIJTaxI D=9 60 6 ; 

RN [1] 

RP SEQUENCE (DISEASE PROTEIN WIS) . 

RX MEDLINE=81021548; PubMed=67747 47 ; 

RA Frangione B., Rosenwasser E., Prelli F., Franklin E.C.; 

RT "Primary structure of human gamma 3 immunoglobulin deletion mutant: 

RT gamma 3 heavy-chain disease protein Wis."; 

RL Biochemistry 19:4304-4308(1980). 

RN [2] 

RP REVISIONS TO 12-97 (PROTEIN WIS) . 

RX MEDLINE=77118561; PubMed=4 02363 ; 

RA Michaelsen T.E., Frangione B., Franklin E.C.; 

RT "Primary structure of the T hinge 1 region of human IgG3. Probable 

RT quadruplication of a 15-amino acid residue basic unit." ; 

RL J. Biol. Chem. 252:883-889(1977). 

RN [3] 

RP REVISIONS TO 59-289 (PROTEIN WIS/DISEASE PROTEIN ZUC) . 

RX MEDLINE=77021516; PubMed=82 394 5 ; 

RA Wolfenstein-Todel C, Frangione B., Prelli F., Franklin E.C.; 

RT "The amino acid sequence of 'heavy chain disease 1 protein ZUC. 

RT Structure of the Fc fragment of immunoglobulin G3 . " ; 

RL Biochem. Biophys . Res. Commun. 71:907-914(1976). 

RN [4] 

RP SEQUENCE FROM N . A. (DISEASE PROTEIN OMM) . 

RX MEDLINE=82247835; PubMed-68 085 05 ; 

RA Alexander A., Steinmetz M. , Barritault D., Frangione B., 

RA Franklin E.C., Hood L., Buxbaum J.N.; 

RT "Gamma Heavy chain disease in man: cDNA sequence supports partial 

RT gene deletion model."; 

RL Proc. Natl. Acad. Sci. U.S.A. 7 9:32 60-3264(1982). 

CC -!- SUBUNIT: Dimer linked by 12 disulfide bonds; it has an extra 
CC interchain disulfide bond at position 7 in addition to the 11 

CC normally present in the hinge region. 

CC -!- MISCELLANEOUS: The heavy chain disease protein WIS is shown. 

CC -!- MISCELLANEOUS: The sequence of residues 42-7 6 was taken from the 

CC Ref.2. 

CC -!- MISCELLANEOUS: Disease protein WIS is lacking most of the V region 
CC and all of the CHI region. 

CC -!- MISCELLANEOUS: Disease protein ZUC lack most of the V region, all 
CC of the CHI region, and part of the hinge compared with normal 

CC gamma-3 heavy chains. 

CC MISCELLANEOUS: Disease protein OMM may represent an allelic form 

CC or another gamma chain subclass. 

CC -!- MISCELLANEOUS: The hinge region in gamma-3 chains is about four 
CC times as long as in other gamma chains and contains three 

CC identical 15-residue segments preceded by a similar 17-residue 

CC segment (12-28) . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; J00231; AAA52805.1; ALT_SEQ. 



DR HSSP; P01857; 1FC1. 

DR Genew; HGNC:552 7; IGHG3. 

DR MIM; 147120; -. 

DR GO; GO: 0005624; C : membrane fraction; NAS. 

DR GO; GO: 0003823; F: antigen binding; TAS . 

DR GO; GO: 0006955; P : immune response; NAS. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig; 2. 

DR SMART; SM00407; IGcl; 1. 

DR PROSITE; PS50835; IG_LIKE; 2. 

DR PROSITE; PS00290; IG__MHC; 1. 

KW Immunoglobulin domain; Immunoglobulin C region; Glycoprotein; Repeat; 
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/FTId-VAR_003895. 
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F -> Y (IN OMM) . 









FT /FTId=VAR_003896. 

SQ SEQUENCE 290 AA; 32331 MW; E69CBC957 05B2 F4 6 CRC64; 



Query Match 69.6%; Score 1143.5; DB 1; Length 290; 

Best Local Similarity 75.6%; Pred. No. 3e-83; 

Matches 220; Conservative 19; Mismatches 37; Indels 15; Gaps 



4; 



Qy 26 QMAG-QCSQNEYFDSLL HACI PCQLRCS SNTPPLTCQRYCNASVTNS 71 

I I I I : : : I III I : I II I I : : 

Db 1 QMQGVNCTVSSELKTPLGDTTHTCPRCPEPKSCDTPP-PCPRCPEPKSCDTPPPCPRCPE 59 

Qy 72 VKGVDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVKFN 131 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I : I 

Db 60 PKSCDTPPPCPRCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVWDVSHEDPEVQFK 119 

Qy 132 WYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWLNGKEYKCKVSNKALPAPIEKTI 191 

I I II I I : II I I I I I I I I : I : I I I : I I I I I I I I I I I : I I : I I I I I I M I I I I I I I I I I I II 

Db 12 0 WYVDGVQVHNAKTKPREQQFNSTFRWSVLTVLHQNWLDGKEYKCKVSNKALPAPIEKTI 17 9 

Qy 192 SKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKTTPP 251 

II I I I I I I I I I I I I I I I I : I : I I I I II I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I 

Db 18 0 SKTKGQPREPQVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESSGQPENNYNTTPP 239 

Qy 252 VLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 

: I I I I I I I I I I I II I I I I I I I I M I : I I I I I I I I I I I I : I I I I I I I I I I I 
Db 24 0 MLDSDGSFFLYSKLTVDKSRWQQGNIFSCSVMHEALHNRFTQKSLSLSPGK 2 90 



RESULT 4 
GC4_HUMAN 

ID GC4_HUMAN STANDARD; PRT; 327 AA. 

AC P01861; 

DT 21-JUL-1986 (Rel. 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Ig gamma-4 chain C region. 

GN IGHG4. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=960 6; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE— 83 157104; PubMed=62 99662; 

RA Ellison J.W., Buxbaum J.N,, Hood L.E.; 

RT "Nucleotide sequence of a human immunoglobulin C gamma 4 gene."; 

RL DNA 1:11-18 (1981) . 

RN [2] 

RP SEQUENCE OF 1-30 AND 81-326. 

RX MEDLINE=7 02 075 60; PubMed=4 1 92 699 ; 

RA Pink J.R.L., Buttery S.H., de Vries G.M. , Milstein C; 

RT "Human immunoglobulin subclasses. Partial amino acid sequence of the 

RT constant region of a gamma 4 chain."; 

RL Biochem. J. 117:33-47(197 0). 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; K01316; AAB59394.1; ALT INIT. 



DR 


PIR; A90933; G4HU. 






DR 


PDB; 1ADQ; 


16-SEP-98 . 






DR 


Genew; HGNC:5528; IGHG4. 






DR 


MIM; 147130; -. 






DR 


GO; GO: 0005624; C:membrane fraction; NAS 






DR 


GO; GO:0003823; F: antigen binding; TAS . 






DR 


GO; GO: 0006955; P : immune response; NAS. 






DR 


InterPro; 


IPR007110; Ig-like. 






DR 


InterPro; 


IPR003597; Ig_cl. 






DR 


InterPro ; 


IPR003006; Ig MHC. 






DR 


Pfam; PF00047; ig; 3. 






DR 


SMART; SM00407; IGcl; 2. 






DR 


PROSITE; PS50835; IG_LIKE; 3. 






DR 


PROSITE; PS00290; IG_MHC; 2. 






KW 


Immunoglobulin domain; Immunoglobulin C 


region; 


3D-s tructure 


FT 


NON_TER 


1 1 






FT 


DOMAIN 


1 98 CHI. 






FT 


DOMAIN 


99 110 HINGE. 






FT 


DOMAIN 


Ill 220 CH2. 






FT 


DOMAIN 


221 327 CH3. 






FT 


DISULFID 


14 14 INTERCHAIN 


(WITH A 


LIGHT CHAIN) 




DISULFID 


27 83 






FT 


DISULFID 


106 106 INTERCHAIN 


(WITH A 


HEAVY CHAIN) 


FT 


DISULFID 


109 109 INTERCHAIN 


(WITH A 


HEAVY CHAIN) 


FT 


DISULFID 


141 201 






FT 


DISULFID 


247 305 






SQ 


SEQUENCE 


327 AA; 35940 MW; 3EDBD811EF208E7A 


CRC64; 


Query Match 


69.5%; Score 1142; 


DB 1; 


Length 327; 


Best Local Similarity 85.4%; Pred. No. 4 


. 6e-83; 




Matches 216; 


Conservative 10; Mismatches 25; 


Indels 



2; Gaps 



QY 
Db 

QY 
Db 

QY 
Db 

QY 
Db 

QY 
Db 



52 SSNTPPLTCQRYCNASVTNSVKGVDKTH — TCPPCPAPELLGGPSVFLFPPKPKDTLMIS 109 

I I II II II: : I I I II I I I I I I I II II II I I I I I I I I I 

75 SLGTKTYTCNVDHKPSNTKVDKRVESKYGPPCPSCPAPEFLGGPSVFLFPPKPKDTLMIS 134 

110 RTPEVTCWVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQDWL 169 

I I I I I I I I I I I I I II I I I : I I I II I II I I I I I I I I I 1 I I I : I I 1 1 I I I I I I I I I I I II I 

135 RTPEVTCVWDVSQEDPEVQFNWYVDGVEVHNAKTKPREEQFNSTYRWSVLTVLHQDWL 194 

170 NGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYP 229 

I I I I I I I I I I I I II: I I I I I I I I I I I I I I I I I I II I II : : I : I I I I I I I I I I I I I I I I 

195 NGKEYKCKVSNKGLPSSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYP 254 



230 



289 



SDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHN 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I II I I I I I I : I I I I I I I I I I I II I I 
2 55 SDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSRLTVDKSRWQEGNVFSCSVMHEALHN 314 



302 



290 HYTQKSLSLSPGK 

I I I I I I I I I I II 
315 HYTQKSLSLSLGK 327 



RESULT 5 
GC_RABIT 

ID GC_RABIT STANDARD; PRT; 323 AA. 

AC P01870; 



DT 21-JUL-1986 (Rel. 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Ig gamma chain C region. 

OS Oryctolagus cuniculus (Rabbit) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

OX NCBI_TaxID=9986; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=84030930; PubMed=63 1352 0 ; 

RA Bernstein K.E., Alexander C.B., Mage R.G.; 

RT "Nucleotide sequence of a rabbit IgG heavy chain from the recombinant 

RT F-I haplotype. " ; 

RL Immunogenetics 18:387-397(1983). 

RN [2] 

RP SEQUENCE OF 1-128. 

RX MEDLINE=76135469; PubMed=1243651 ; 

RA Pratt D.M., Mole L.E.; 

RT "Sequence studies on the constant region of the Fd sections of rabbit 

RT immunoglobulin G of different allotype."; 

RL Biochem. J. 151:337-349(1975). 

RN [3] 

RP SEQUENCE OF 8 8-2 66 FROM N.A. 

RX MEDLINE=832 9 9917; PubMed=61 935 12 ; 

RA Martens C.L., Moore K.W. , Steinmetz M. , Hood L., Knight K.L.; 

RT "Heavy chain genes of rabbit IgG: isolation of a cDNA encoding gamma 

RT heavy chain and identification of two genomic C gamma genes."; 

RL Proc. Natl. Acad. Sci. U.S.A. 79:6018-6022(1982). 

RN [4] 

RP SEQUENCE OF 132-161. 

RX MEDLINE=7 0110015; PubMed=54 611 06 ; 

RA Fruchter R.G., Jackson S.A., Mole L.E., Porter R.R.; 

RT "Sequence studies of the Fd section of the heavy chain of rabbit 

RT immunoglobulin G."; 

RL Biochem. J. 116:249-259(1970). 

RN [5] 

RP SEQUENCE OF 129-131 AND 155-322. 

RA Hill R.L., Lebovitz H.E., Fellows R.E. Jr., Delaney R. ; 

RL (In) Killander J. (eds.); 

RL Gamma globulins, Nobel symp. 3, pp. 109-127, Almqvist and Wiksell, 

RL Stockholm (1967) . 

CC -!- MISCELLANEOUS: Ref.l sequence has the D12 allotypic marker, 

CC 104-Thr, and the E14 marker, 185-Thr. Ref.3 has the Dll and E15 

CC markers and Ref.5 the E15 marker. 

CC -!- SIMILARITY: Contains 3 immunoglobulin-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M16426; AAA31289.1; 

DR PIR; A91749; GHRB. 



DR 


HSSP; P01 


857; 1FC1 


















DR 


InterPro ; 


IPR007110; Ig 


-like. 














DR 


InterPro; 


IPR003597; Ig 


_cl. 














DR 


InterPro; 


IPR003006; Ig 


_MHC. 














DR 


Pfam; PF00047; ig; 


3. 
















DR 


SMART; SM004 07; IGcl; 2 
















DR 


PROSITE; 


PS50835; 


IG LIKE; 3. 














DR 


PROSITE; 


PS00290; 


IGJMHC; 1. 














KW 


Immunoglobulin domain; 


Immunoglobulin C 


region; Repeat 


FT 


NONJTER 


1 


1 
















FT 


DOMAIN 


6 


96 




IG 


-LIKE 


1. 






FT 


DOMAIN 


114 


213 




IG 


-LIKE 


2 . 






FT 


DOMAIN 


222 


318 




IG 


-LIKE 


3. 






FT 


VARIANT 


104 


104 




T 


-> 


M 


(IN 


Dll 


MARKER) . 


FT 


VARIANT 


185 


185 




T 


-> 


A 


(IN 


E15 


MARKER) . 


FT 


CONFLICT 


48 


48 




N 


-> 


E 


(IN 


REF 


2) . 


FT 


CONFLICT 


71 


71 




V 


-> 


VPV (IN REF. 2) . 


FT 


CONFLICT 


144 


144 




Q 


-> 


E 


(IN 


REF 


3 AND 4) . 


FT 


CONFLICT 


173 


173 




N 


-> 


D 


(IN 


REF 


5) . 


FT 


CONFLICT 


187 


187 




Q 


-> 


E 


(IN 


REF 


3 AND 5) . 


FT 


CONFLICT 


201 


201 




N 


-> 


D 


(IN 


REF 


5) . 


FT 


CONFLICT 


218 


218 




Q 


-> 


E 


(IN 


REF 


5) . 


FT 


CONFLICT 


233 


233 




E 


-> 


Q 


(IN 


REF 


5) . 


FT 


CONFLICT 


246 


246 




N 


-> 


D 


(IN 


REF 


5) . 


FT 


CONFLICT 


256 


256 




E 


-> 


G 


(IN 


REF 


5) . 


FT 


CONFLICT 


260 


260 




N 


-> 


D 


(IN 


REF 


5) . 


FT 


CONFLICT 


266 


266 




N 


-> 


D 


(IN 


REF 


5) . 


FT 


CONFLICT 


280 


280 




Y 


-> 


W 


(IN 


REF 


5) . 


FT 


CONFLICT 


284 


284 




N 


-> 


S 


(IN 


REF 


5) . 


SQ 


SEQUENCE 


323 AA; 


35404 MW; 




69E8AA11E 


5D579A8B CRC64 ; 



Query Match 56.9%; Score 934.5; DB 1; 

Best Local Similarity 68.0%; Pred. No. l.le-66; 
Matches 174; Conservative 33; Mismatches 40; 



Length 323; 
Indels 9; 



Gaps 



3; 



QY 
Db 

QY 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



52 SSNTPPLTCQRYCNASVTNSVKGVDKT HTC — PPCPAPELLGGPSVFLFPPKPKDTL 106 

: I : : I : I I I : I I I I I II I II I I I I I I I I I I : I I I I I I I I I 

72 TSSSQPVTCNVAHPATNTK VDKTVAPSTCSKPTCPPPELLGGPSVFIFPPKPKDTL 127 

107 MISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLHQ 166 

I I I I I I I I I I I I I I I I : I I I I : I II:: : I I : I I : I : I I I I I I I I : II 
128 MISRTPEVTCVWDVSQDDPEVQFTWYINNEQVRTARPPLREQQFNSTIRWSTLPITHQ 187 

167 DWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKG 22 6 

III I I I : I I I I I I I I I I I I I I I I I I I : I I I I I : I I I : I I : I ! : Mill:: | 
188 DWLRGKEFKCKVHNKALPAPIEKTISKARGQPLEPKVYTMGPPREELSSRSVSLTCMING 247 

22 7 FYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEA 28 6 

111111:1111 II: I : I I I I I I I I I I I I I : I I I : I I : I I I I : I : I I : I I I II I I 
24 8 FYPSDISVEWEKNGKAEDNYKTTPAVLDSDGSYFLYNKLSVPTSEWQRGDVFTCSVMHEA 307 

287 LHNHYTQKSLSLSPGK 302 

111111111:1 I I I I 
308 LHNHYTQKSISRSPGK 323 



RESULT 6 



GC2_CAVP0 

ID GC2_CAVP0 STANDARD; PRT; 32 9 AA. 

AC P01862; 

DT 21-JUL-1986 (Rel. 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-JUL-1999 (Rel. 38, Last annotation update) 

DE Ig gamma-2 chain C region. 

OS Cavia porcellus (Guinea pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Hystricognathi ; Caviidae; Cavia. 

OX NCBI_TaxID=10141; 

RN [1] 

RP SEQUENCE OF 1-3. 

RA Trischmann T.M.; 

RL Submitted (APR-1975) to the PIR data bank. 

RN [2] 

RP SEQUENCE OF 4-68. 

RX MEDLINE=7 1058 471; PubMed=553 8 60 6 ; 

RA Birshtein B.K., Hussain Q.Z., Cebra J. J.; 

RT "Structure of heavy chain from strain 13 guinea pig 

RT immunoglobulin-G (2 ) . 3. Amino acid sequence of the region around the 

RT half-cystine joining heavy and light chains."; 

RL Biochemistry 10:18-25(1971). 

RN [3] 

RP SEQUENCE OF 69-133 AND 312-329. 

RX MEDLINE=7 105848 6; PubMed-553 8 616 ; 

RA Turner K.J., Cebra J. J.; 

RT "Structure of heavy chain from strain 13 guinea pig 

RT immunoglobulin-G (2 ) . II. Amino acid sequence of the carboxyl-terminal 

RT and hinge region cyanogen bromide fragments."; 

RL Biochemistry 10:9-17(1971). 

RN [4] 

RP SEQUENCE OF 134-226. 

RX MEDLINE=75036072; PubMed=4 42 9665 ; 

RA Tracey D.E., Cebra J. J.; 

RT "Primary structure of the CH2 homology region from guinea pig IgG2 

RT antibodies . " ; 

RL Biochemistry 13:47 96-4 803(1974). 

RN [5] 

RP SEQUENCE OF 227-311. 

RX MEDLINE=75036073; PubMed=4 60 94 67 ; 

RA Trischmann T.M., Cebra J. J.; 

RT "Primary structure of the CH3 homology region from guinea pig IgG2 

RT antibodies."; 

RL Biochemistry 13:48 04-4 811(1974). 

RN [6] 

RP DISULFIDE BONDS. 

RX MEDLINE=7 1058474; PubMed=4 92254 4 ; 

RA Oliveira B., Lamm M.E.; 

RT "Interchain disulfide bridges of guinea pig gamma-2 -immunoglobul in ." ; 

RL Biochemistry 10:26-31(1971). 

CC -!- MISCELLANEOUS: This chain was isolated from pooled serum of strain 

CC 13 inbred guinea pigs. 

DR PIR; A94553; G2GP. 

DR HSSP; P01842; 7 FAB . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 



DR 


InterPro 


; IPR003006; Ig 


JVtHC. 






DR 


Pfam; PF00047; ig; 


2. 








DR 


SMART; SM00407; IGcl; 2 








DR 


PROSITE; 


PS50835; 


IG LIKE; 3. 






DR 


PROSITE; 


PS00290; 


IG MHC; 1. 






KW 


Immunoglobulin domain; 


Immunoglobulin C region; 


Glycoprotein 


FT 


NON TER 


1 


1 








FT 


DISULFID 


16 


16 




INTERCHAIN (WITH A 


LIGHT CHAIN) 


FT 


DISULFID 


28 


79 








FT 


DISULFID 


105 


105 




INTERCHAIN (WITH A 


HEAVY CHAIN) 


FT 


DISULFID 


107 


107 




INTERCHAIN (WITH A HEAVY CHAIN) 


FT 


DISULFID 


110 


110 




INTERCHAIN (WITH A 


HEAVY CHAIN) 


FT 


DISULFID 


142 


202 








FT 


CARBOHYD 


178 


178 




N-LINKED (GLCNAC. . 


.) ■ 


FT 


DISULFID 


248 


308 








SQ 


SEQUENCE 


32 9 AA; 


36074 MW; 


5D231B7164D1FBA9 


CRC64; 


Query Match 




55 


.0%; 


Score 903.5; DB 1; 


Length 329 


Best Local Similarity 


65 


.8%; 


Pred. No. 3.2e-64; 





Matches 171; Conservative 27; Mismatches 49; Indels 13; Gaps 2 

Qy 55 TPPLTCQRYCNASVTNSVKGVDKT HTCPPCPAPELLGGPSVFLFPPKPK 103 

II:: II: I I I I I I I I I I I I I I I I I I I : I I I I I I 

Db 70 TVPSSQKATCNVAHPASSTKVDKTVEPIRTPZPBPCTCPKCPPPENLGGPSVFIFPPKPK 129 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



104 



130 



164 



190 



DTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHN7VKTKPREEQYNSTYRWSVLTV 163 
I I I I I I II II II I I I II : : I II : I I : I I I I I : I II I I I II : I : I I III : 
DTLMISLTPRVTCVWDVSQDEPEVQFTWFVDNKPVGNAETKPRVEQYNTTFRVESVLPI 189 

LHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCL 22 3 

I I I II 111:1111 I I I I I I I I I I I I I I II II I I I I I I I I I I II : I : : II : I I I 
QHQDWLRGKEFKCKVYNKALPAPIEKTISKTKGAPRMPDVYTLPPSRDELSKSKVSVTCL 249 



224 VKGFYPSDIAVEWESNGQP — ENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCS 281 

: I : I : I I I I I I I I I I I I I I : I : I I I : I I I I I I I I I I I I 1 I I : : I I 
250 IINFFPADIHVEWASNRVPVSEKEYKNTPPIEDADGSYFLYSKLTVDKSAWDQGTVYTCS 309 



282 



301 



VMHEALHNHYTQKSLSLSPG 

I I I I I I I I I I I I : : I III 
310 VMHEALHNHVTQKAISRSPG 329 



RESULT 7 
GC3_MOUSE 

ID GC3__MOUSE STANDARD; PRT; 329 AA. 

AC P22436; 

DT 01-AUG-1991 (Rel. 19, Created) 

DT 01-AUG-1991 (Rel. 19, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Ig gamma-3 chain C region, secreted form. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=85027161; PubMed=6092053 ; 



RA Wels J. A., Word C.J., Rimm D., Der-Balan G.P., Martinez H.M., 

RA Tucker P.W., Blattner F.R.; 

RT "Structural analysis of the murine IgG3 constant region gene."; 

RL EMBO J. 3:2041-2046(1984). 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; J00451; -; N0T_ANN0TATED_CDS . 

DR PIR; B02156; G3MSC. 

DR HSSP; P01857; 1FC1. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig; 3. 

DR SMART; SM00407; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Immunoglobulin domain; Immunoglobulin C region; Glycoprotein; 

KW Transmembrane; Alternative splicing. 



FT NONJTER 


1 


1 






FT DOMAIN 


1 


97 


CHI. 




FT DOMAIN 


98 


113 


HINGE. 




FT DOMAIN 


114 


223 


CH2 . 




FT DOMAIN 


224 


327 


CH3. 




SQ SEQUENCE 


32 9 AA; 


3622 


8 MW; F45827174182BAD6 


CRC64; 


Query Match 




52. 


1%; Score 856; DB 1; 


Length 32 9; 


Best Local Similarity 


61. 


7%; Pred. No. 1.8e-60; 




Matches 161 ; 


Conservative 


37; Mismatches 51; 


Indels 12; 



Gaps 4; 

Qy 52 SSNTPPLTCQRYCN AS VTN S VKGVD KTHTCP — PCPAPELLGGPSVFLFPPK 101 

II I I II III : I : : I II II : I I I I I I I : I I I I 

Db 71 SSTWPSQTV — ICNVAHPASKTELIKRIEPRIPKPSTPPGSSCPPGNILGGPSVFIFPPK 128 

Qy 102 PKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVL 161 

III I I I I I I : I I M II I I I : IE : I : I : II III I hill 1 I I I I : I I I I I 

Db 129 PKDALMISLTPKVTCVWDVSEDDPDVHVSWFVDNKEVliTAWTQPREAQYNSTFRWSAL 188 

Qy 162 TVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT 221 

: MM: I I I : M II : I I I M I II I M I II I I : : I I I M : II I : : : : I Mill 
Db 189 PIQHQDWMRGKEFKCKVNNKALPAPIERTISKPKGRAQTPQVYTIPPPREQMSKKKVSLT 248 

Qy 222 CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCS 281 

III I : I M II I I I : I : I I I I I M I I I I : M II II I I I I I I I M M I 
Db 249 CLVTNFFSEAISVEWERNGELEQDYKNTPPILDSDGTYFLYSKLTVDTDSWLQGEIFTCS 308 

Qy 282 VMHEALHNHYTQKSLSLSPGK 302 

I M II II I I : I M : II I I I I 
Db 309 WHEALHNHHTQKNLSRSPGK 329 



RESULT 8 
GCB__RAT 

ID GCB_RAT STANDARD; PRT; 333 AA. 

AC P20761; 

DT 01-FEB-1991 (Rel. 17, Created) 

DT 01-FEB-1991 (Rel. 17, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Ig gamma-2B chain C region. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=89232738; PubMed=314994 6 ; 

RA Brueggemann M. ; 

RT "Evolution of the rat immunoglobulin gamma heavy-chain gene family."; 

RL Gene 74:473-482(198 8). 

CC -!- SIMILARITY: Contains 3 immunoglobulin-like domains. 



DR 


PIR; PS0018; PS0018 














DR 


HSSP; P01 


842; 7 FAB . 














DR 


InterPro; 


IPR007110; Ig 


-like. 










DR 


InterPro ; 


IPR003597 


; ig. 


_cl. 










DR 


InterPro; 


IPR003006 


; ig 


_MHC. 










DR 


Pfam; PF00047; ig; 


3. 












DR 


SMART; SM00407; IGcl; 2 












DR 


PROSITE; 


PS50835; IG_LIKE; 3. 










DR 


PROSITE; 


PS00290; IG MHC; 1. 










KW 


Immunoglobulin domain; 


Immunoglobulin C 


region; 


Repeat 




FT 


NON_TER 


1 


l 












FT 


DOMAIN 


6 


96 




IG-LIKE 1. 








FT 


DOMAIN 


124 


223 




IG-LIKE 2. 








FT 


DOMAIN 


232 


328 




IG-LIKE 3. 








FT 


DISULFID 


15 


15 




INTERCHAIN 


(WITH A 


LIGHT 


CHAIN) 


FT 


DISULFID 


27 


80 












FT 


DISULFID 


106 


106 




INTERCHAIN 


(WITH A 


HEAVY 


CHAIN) 


FT 


DISULFID 


109 


109 




INTERCHAIN 


(WITH A 


HEAVY 


CHAIN) 


FT 


DISULFID 


112 


112 




INTERCHAIN 


(WITH A 


HEAVY 


CHAIN) 


FT 


DISULFID 


115 


115 




INTERCHAIN 


(WITH A 


HEAVY 


CHAIN) 


FT 


DISULFID 


147 


207 












FT 


DISULFID 


253 


311 












SQ 


SEQUENCE 


333 AA; 


36497 MW; 


55F8B64D4 


8D4 60A6 


CRC64 ; 




Query Match 




51 


.7%; 


Score 849; 


DB 1; 


Length 


333; 


Best Local Similarity 


52 


. 6%; 


Pred. No. 6 


• 5e-60; 







Matches 170; Conservative 42; Mismatches 73; Indels 38; Gaps 6; 

Qy 16 PGSTGDVTMLQMAGQCSQNEYF DSLLHACIPCQLR CSS 53 

II I I I : I II | : | | | : : | 

Db 13 PG-CGDTTSSTVTLGCLVKGYFPEPVTVTWNSGAXSSDVHT-FPAVLQSGLYTLTSSVTS 70 

Qy 54 NTPP LTCQRYCNASVTNSVKGVDKTH TCPPCPAPELLGGPSVFLFP 99 

: I I : I I II I II::: II I I I I I II I I I I I : I I 

Db 71 STWPSQTVTCNVAHPASSTKVDKKVERRNGGIGHKCPTCPTCHKCPVPELLGGPSVFIFP 130 

Qy 100 PKPKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWS 159 

II I I I I : I I : : I I I I I I I I I | : | : I : I : I : I : I I I I I : I : I I I I I I I II : I I I I 



Db 



131 PKPKDILLISQNAKVTCVWDVSEEEPDVQFSWFVNNVEVHTAQTQPREEQYNSTFRWS 190 



Qy 160 VLTVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQWTLPPSRDELTKNQVS 219 

I : I I I I :: 1 II : I I I I : I I I I I : I I II I I I I II I : I I I I : I : : I I : II 
Db 191 ALPIQHQDWMSGKEFKCKVNNKALPSPIEKTISKPKGLVRKPQVYVMGPPTEQLTEQTVS 250 

Qy 220 LTCLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFS 279 

I I I I I I I : I I I I I I I I I I I I I I I : I I I I I I I : I I I I I :: I I I I 
Db 251 LTCLTSGFLPNDIGVEWTSNGHIEKNYKNTEPVMDSDGSFFMYSKLNVERSRWDSRAPFV 310 

Qy 280 CSVMHEALHNHYTQKSLSLSPGK 302 

111:11 MM: M I : I Ml 

Db 311 CSWHEGLHNHHVEKSISRPPGK 333 



RESULT 9 
GC3MJVIOUSE 

ID GC3M__MOUSE STANDARD; PRT; 398 AA. 

AC P03987; 

DT 23-OCT-1986 (Rel. 02, Created) 

DT 01-AUG-1991 (Rel. 19, Last sequence update) 

DT 15-JUL-1999 (Rel. 38, Last annotation update) 

DE Ig gamma-3 chain C region, membrane -bound form. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=85027161; PubMed=60 92 053 ; 

RA Wels J. A. , Word C.J., Rimm D., Der-Balan G.P., Martinez H.M., 

RA Tucker P.W., Blattner F.R.; 

RT "Structural analysis of the murine IgG3 constant region gene."; 

RL EMBO J. 3:2041-2046(1984). 

RN [2] 

RP SEQUENCE OF 328-398 FROM N.A. 

RX MEDLINE=84 0414 83; PubMed=631425 8 ; 

RA Komaromy M. , Clayton L., Rogers J., Robertson S., Kettman J., 

RA Wall R. ; 

RT "The structure of the mouse immunoglobulin in gamma 3 membrane gene 

RT segment."; 

RL Nucleic Acids Res. 11:6775-6785(1983). 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; J00451; AAB59655.1; -. 

DR EMBL; V01526; CAA24767.1; ALT_SEQ . 

DR PIR; A02156; G3MSM. 

DR HSSP; P01857; 1FC1. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 



DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig; 3. 

DR SMART ; SM004 07; I Gel; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Immunoglobulin domain; Immunoglobulin 

KW Transmembrane; Alternative splicing. 



C region; Glycoprotein; 



FT 


NONJTER 


1 


1 




FT 


DOMAIN 


1 


97 


CHI. 


FT 


DOMAIN 


98 


113 


HINGE . 


FT 


DOMAIN 


114 


223 


CH2 . 


FT 


DOMAIN 


224 


327 


CH3 . 


FT 


TRANSMEM 


346 


362 


POTENTIAL. 


FT 


DOMAIN 


363 


398 


CYTOPLASMIC (POTENTIAL) . 


FT 


CONFLICT 


333 


333 


E -> G (IN REF. 2) . 


FT 


CONFLICT 


342 


342 


E -> Q (IN REF. 2) . 


FT 


CONFLICT 


388 


388 


P -> F (IN REF. 2) . 


SQ 


SEQUENCE 


398 AA; 


43929 


MW; CF7F264B50A41B95 CRC64; 


Query Match 




51. 4 s 


&; Score 845; DB 1; Length 



Best Local Similarity 61.4%; Pred. No. 1.7e-59; 

Matches 159; Conservative 37; Mismatches 51; Indels 12; Gaps 



Qy 

Db 

Qy 

Db 



52 SSNTPPLTCQRYCN ASVTNSVKGVD KTHTCP — PCPAPELLGGPSVFLFPPK 101 

II I I || III : I :: I II II : I II I I I 1 : I I I I 

71 SSTWPSQTV — ICNVAHPASKTELIKRIEPRIPKPSTPPGSSCPPGNILGGPSVFIFPPK 12 8 

102 PKDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVL 161 

III I I I I 11:111111111 : I I : I : I : I I III I 1 = 111 I I I I I : I I I I I 

12 9 P KDALMI S LT P KVT CVWDVS E D D P DVHVS W FVDNKE VHT AWT Q P REAQ YN S T FRWS AL 18 8 



Qy 

Db 

Qy 

Db 



162 TVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT 221 

: Mil: I I I : I I I I : I I I I I I I I I : I I I I I I : : I II I I : II I : : : : I HIM 

189 PIQHQDWMRGKEFKCKVNNKALPAPIERTISKPKGRAQTPQVYTIPPPREQMSKKKVSLT 248 

222 CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCS 281 

III I : I : I II I I I : I : M II I : I M I I : : I I I I II II I I I I M : I I 

249 CLVTNFFSEAISVEWERNGELEQDYKNTPPILDSDGTYFLYSKLTVDTDSWLQGEIFTCS 30 8 



Qy 

Db 



282 VMHEALHNHYTQKSLSLSP 300 

I : I II II II : I I I : I I II 
309 WHEALHNHHTQKNLSRSP 327 



RESULT 10 
GC1_RAT 

ID GC1_RAT STANDARD; PRT; 326 AA. 

AC P20759; 

DT 01-FEB-1991 (Rel. 17, Created) 

DT 01-FEB-1991 (Rel. 17, Last sequence update) 

DT 15-JUL-1999 (Rel. 38, Last annotation update) 

DE Ig gamma-1 chain C region. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 



RP 


SEQUENCE 


FROM N.A. 










RX 


MEDLINE-8 


9232738; 


PubMed=3149946; 






RA 


Brueggemann M. ; 










RT 


"Evolution of the 


rat immunoglobulin gamma heavy-chain gene family 


RL 


Gene 74:473-482 (19 


88) . 








DR 


PIR; PS0017; PS0017. 








DR 


HSSP; P01 


842; 7 FAB 










DR 


InterPro; 


IPR007110; Ig-like. 








DR 


InterPro; 


IPR003597; Ig_cl. 








DR 


InterPro; 


IPR003006; Ig MHC. 








DR 


Pfam; PF00047; ig; 


3. 








DR 


SMART; SM00407; IGcl; 2. 








DR 


PROSITE; 


PS50835; 


IG_LIKE; 3. 








DR 


PROSITE; 


PS00290; 


IG MHC; 1. 








KW 


Immunoglobulin domain; Immunoglobulin C 


region; 


Glycoprotein . 


FT 


NON_TER 


1 


1 








FT 


DOMAIN 


1 


97 


CHI. 






FT 


DOMAIN 


98 


112 


HINGE. 






FT 


DOMAIN 


113 


219 


CH2 . 






FT 


DOMAIN 


220 


326 


CH3. 






FT 


DISULFID 


27 


82 








FT 


DISULFID 


102 


102 


INTERCHAIN 


(WITH A 


HEAVY CHAIN) . 


FT 


DISULFID 


106 


106 


INTERCHAIN 


(WITH A 


HEAVY CHAIN) . 


FT 


DISULFID 


109 


109 


INTERCHAIN 


(WITH A 


HEAVY CHAIN) . 


FT 


DISULFID 


111 


111 


INTERCHAIN 


(WITH A 


HEAVY CHAIN) . 


FT 


DISULFID 


140 


200 








FT 


DISULFID 


246 


304 








FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC . . 


. ) (POTENTIAL) . 


SQ 


SEQUENCE 


326 AA; 


35946 MW; 


013BAB45EF49B9DA 


CRC64; 



Query Match 50.3%; Score 827; DB 1; Length 326; 

Best Local Similarity 58.2%; Pred. No. 3.5e-58; 

Matches 152; Conservative 42; Mismatches 51; Indels 16; Gaps 4 

Qy 52 SSNTPPLTCQRYCNASVTNSVKGVDKT HTCPPCPAPELLGG PSVFLFPPK 101 

I I I I II: I III III : I 111:1111 

Db 72 SSTWPSQTVT — CNVAHPASSTKVDKKIVPRNCGGDCKPC ICTGSEVSSVFI FPPK 125 



Qy 102 P KDT LMI S RT P E VT CVWDVS H E D P EVK FNW YVD GVE VHN AKT KP REEQ YN S T YRWS VL 161 

III I I : I I : I I I I I II : I : I I I I I : I : II I I I I I : I : I I II : I I I : I II I 

Db 12 6 PKDVLTITLTPKVTCVWDISQDDPEVHFSWFVDDVEVHTAQTRPPEEQFNSTFRSVSEL 185 

Qy 162 TVLHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLT 221 

: I I I I I I I I : : : I I I : : I I : I I I I I I II : I : : I III: I : : : ! : I : I :• I I : I 

Db 186 P I LHQDWLNGRTFRCKVT SAAFP S PI EKT I S KPEGRTQVPHVYTMS PTKEEMTQNEVS I T 245 

Qy 222 CLVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCS 281 

I : I I I I I I II III: MM: III III : I : I I I : I I I I II I I MINI I : I I 

Db 246 CMVKGFYPPDIYVEWQMNGQPQENYKNTPPTMDTDGSYFLYSKLNVKKEKWQQGNTFTCS 305 



Qy 282 VMHEALHNHYTQKSLSLSPGK 302 

I : I I I I I I : I : I I I I II I I 
Db 306 VLHEGLHNHHTEKSLSHSPGK 326 



RESULT 11 
GCC RAT 



ID GCC_RAT STANDARD; PRT; 329 AA. 

AC P2 07 62; 

DT 01-FEB-1991 (Rel. 17, Created) 

DT 01-FEB-1991 (Rel. 17, Last sequence update) 

DT 15-JUL-1999 (Rel. 38, Last annotation update) 

DE Ig gamma-2C chain C region. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=8 8 166903; PubMed=312 7222 ; 

RA Brueggemann M. , Delmas tro-Galf re P., Waldmann H., Calabi F. ; 

RT "Sequence of a rat immunoglobulin gamma 2c heavy chain constant 

RT region cDNA: extensive homology to mouse gamma 3 . " ; 

RL Eur. J. Immunol. 18:317-319(1988). 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X07189; CAA30169.1; -. 

DR PIR; S00847; S00847. 

DR HSSP; P01842; 7 FAB . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig; 2. 

DR SMART; SM00407; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Immunoglobulin domain; Immunoglobulin C region. 



FT 


N0N_TER 


1 


1 










FT 


DOMAIN 


1 


97 


CHI. 








FT 


DOMAIN 


98 


113 


HINGE. 








FT 


DOMAIN 


114 


222 


CH2 . 








FT 


DOMAIN 


223 


329 


CH3 . 








FT 


DISULFID 


15 


15 


INTERCHAIN 


(WITH A 


LIGHT 


CHAIN) 


FT 


DISULFID 


27 


82 










FT 


DISULFID 


111 


111 


INTERCHAIN 


(WITH A 


HEAVY 


CHAIN) 


FT 


DISULFID 


113 


113 


INTERCHAIN 


(WITH A 


HEAVY 


CHAIN) 


FT 


DISULFID 


143 


203 










FT 


DISULFID 


249 


307 










SQ 


SEQUENCE 


329 AA; 


36571 


MW; 5FCD7B7 933 850773 


CRC64; 





Query Match 50.3%; Score 826; DB 1; Length 329; 

Best Local Similarity 51.6%; Pred. No. 4.2e-58; 

Matches 165; Conservative 51; Mismatches 70; Indels 34; Gaps 7; 

Qy 15 VPGSTGDVTMLQMAGQCSQNEYF DSLLHACI PCQLR -CSSNT 55 

Ml * I I II II I : I II: Ml 

Db 12 VPGCSGTSGSLVTLG-CLVKGYFPEPVTVKWNSGALSSGVHT-FPAVLQSGLYTLSSSVT 69 



Qy 56 PP LTCQRYCNASVTNSVKGVD — KTHTCPP CPAPELLGGPSVFLFPPKP 102 

I : I I | : : | : | : : : I I I : I I I I I I : I I I I I 

Db 7 0 VPSSTWSSQTWCSVAHPATKSNLIKRIEPRRPKPRPPTDICSCDDNLGRPSVFIFPPKP 12 9 

Qy 103 KDTLMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLT 162 



Db 130 KDILMITLTPKVTCVWDVSEEEPDVQFSWFVDNVRVFTAQTQPHEEQLNGTFRWSTLH 189 

Qy 163 VLHQDWLNGKE YKCKVSNKALPAP IEKT I S KAKGQPREPQVYTLP P S RDELTKNQVS LTC 222 

: I I I I :: I I I : I I I I : I I I I : I I I I I I I I : I : I I I I I I : II I : : : : I i : I I I I I 
Db 190 IQHQDWMSGKEFKCKVNNKDLPSPIEKTISKPRGKARTPQVYTIPPPREQMSKNKVSLTC 249 

Qy 223 LVKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSV 282 

: I III: I : I I I I I I : I : I I I I I I I I I I : I I I I I I : I I I : I : : : : I I I 
Db 250 MVTSFYPASISVEWERNGELEQDYKNTLPVLDSDESYFLYSKLSVDTDSWMRGDIYTCSV 309 

Qy 283 MHEALHNHYTQKSLSLSPGK 302 

: I I I I I I I : I I I : I I MM 
Db 310 VHEALHNHHTQKNLSRS PGK 329 



RESULT 12 
GC1_M0USE 

ID GC1_M0USE STANDARD; PRT; 32 4 AA. 

AC P01868; 

DT 21-JUL-1986 (Rel. 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Ig gamma-1 chain C region secreted form. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=8 0 045036; PubMed=115593 ; 

RA Honjo T., Obata M. , Yamawaki-Kataoka Y. , Kataoka T., Kawakami T., 

RA Takahashi N., Mano Y. ; 

RT "Cloning and complete nucleotide sequence of mouse immunoglobulin 

RT gamma 1 chain gene."; 

RL Cell 18:559-568(1979). 

RN [2] 

RP SEQUENCE OF 76-324 FROM N.A. (MYELOMA PROTEIN MOPC 31C) . 

RX MEDLINE=80202559; PubMed=67 697 52 ; 

RA Obata M. , Yamawaki-Kataoka Y., Takahashi N., Kataoka T., Shimizu A., 

RA Mano Y. , Seidman J.G., Peterlin B.M. f Leder P., Honjo T.; 

RT "Immunoglobulin gamma 1 heavy chain gene: structural gene sequences 

RT cloned in a bacterial plasmid."; 

RL Gene 9 : 8 7- 97 ( 1 98 0 ) . 

RN [3] 

RP SEQUENCE OF 70-322 FROM N.A. (MYELOMA PROTEIN MOPC 21) . 

RX MEDLINE=80012837; PubMed-1 1377 6 ; 

RA Rogers J., Clarke P., Salser W.; 

RT "Sequence analysis of cloned cDNA encoding part of an immunoglobulin 

RT heavy chain. "; 

RL Nucleic Acids Res. 6:3305-3321(1979). 



RN [4] 

RP SEQUENCE (MYELOMA PROTEIN MOPC 21) . 

RX MEDLINE=78242288; PubMed=98524 ; 

RA Adetugbo K. ; 

RT "Evolution of immunoglobulin subclasses. Primary structure of a 

RT murine myeloma gammal chain. 1 ' ; 

RL J. Biol. Chem. 253:6068-6075(197 8). 

RN [5] 

RP DISULFIDE BONDS (MOPC 21). 

RX MEDLINE-73008889; PubMed=5073237 ; 

RA Svasti J., Milstein C; 

RT "The disulphide bridges of a mouse immunoglobulin Gl protein."; 

RL Biochem. J. 126:837-850(1972). 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event-Alternative splicing; Named isoforms=2; 

CC Name^Secreted; 

CC IsoId=P01868-l; Sequence=Displayed; 

CC Note=May be the major isoform; 

CC Name=Membrane-bound; 

CC IsoId=P018 69-l; Sequence=External ; 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; V00793; CAA24172.1 

DR EMBL; V00793; CAA24173.1 

DR EMBL; V00793; CAA24174.1 

DR EMBL; V00793; CAA24175.1 

DR EMBL; V00795; CAA24176.1, 

DR PIR; A02159; G1MS . 

DR GlycoSuiteDB; P01868; -. 

DR MGD; MGI: 96446; Igh-4. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig; 2. 

DR SMART; SM00407; IGcl; 2. 

DR PROSITE; PS50835; IG_LIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Immunoglobulin domain; Immunoglobulin C 



region; Glycoprotein; 



KW 


Alternative 


splicing . 




FT 


NONJTER 


1 


1 




FT 


DOMAIN 


1 


97 


CHI. 


FT 


DOMAIN 


98 


110 


HINGE. 


FT 


DOMAIN 


111 


217 


CH2 . 


FT 


DOMAIN 


218 


324 


CH3. 


FT 


DISULFID 


27 


82 




FT 


DISULFID 


102 


102 


INTERCHAIN 


FT 


DISULFID 


104 


104 


INTERCHAIN 


FT 


DISULFID 


107 


107 


INTERCHAIN 


FT 


DISULFID 


109 


109 


INTERCHAIN 



(WITH A LIGHT CHAIN) . 

(WITH A HEAVY CHAIN) . 

(WITH A HEAVY CHAIN) . 

(WITH A HEAVY CHAIN) . 



FT 


DISULFID 


138 


198 




FT 


CARBOHYD 


174 


174 


N-LINKED (GLCNAC. . .). 


FT 








/FTId=CAR__000055. 


FT 


DISULFID 


244 


302 




FT 


MOD RES 


324 


324 


REMOVED POST-TRANSLATIONALLY 


FT 


CONFLICT 


276 


276 


N -> D (IN REF. 3) . 


FT 


CONFLICT 


278 


278 


N -> D (IN REF. 3) . 


SQ 


SEQUENCE 


324 AA; 


35704 


MW; A338812F3D1F2C93 CRC64; 



Query Match 50.2%; Score 825.5; DB 1; Length 324; 

Best Local Similarity 51.1%; Pred. No. 4.5e-58; 

Matches 162; Conservative 49; Mismatches 71; Indels 35; Gaps 



Qy 

Db 



16 PGSTGDVTMLQMAGQCSQNEYF DSLLHACIPCQLR CSSNTP 56 

I II : I I II I : I I I : III 

13 PGSAAQTNSMVTLG-CLVKGYFPEPVTVTWNSGSLSSGVHT-FPAVLQSDLYTLSSSVTV 70 



Qy 



Db 



57 P LTCQRYCNASVTN-SVKGVDKTHTCPP--CPAPELLGGPSVFLFPPKPKDT 105 

I : I I Mill: Mill: I I I : II I I I I I 

71 PSSPRPSETVTCNVAHPASSTKVDKKIVPRDCGCKPCICTVPEV SSVFIFPPKPKDV 127 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



106 LM I S RT P E VT CVWDVS H E D P E VK FNW YVD GVE VHN AKT K P RE EQ YN STY R WS VLT VL H 165 

I I : I I : I I I I II I : I : I I I I : I : I : I I Mil I : I : I I I I I : I I I : I II I : : I 
128 LTITLTPKVTCVWDISKDDPEVQFSWFVDDVEVHTAQTQPREEQFNSTFRSVSELPIMH 187 

166 QDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVK 225 

I I I I I I II : II : I : : I I II I I I II I I I I : I : I I I I I : I I : : : : I : : I I II I : : 
188 QDWLNGKEFKCRVNSAAFPAPIEKTISKTKGRPKAPQVYTIPPPKEQMAKDKVSLTCMIT 247 

226 GFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHE 2 85 

Ml II MM I II I III I M : : : : I M M I I II I II M II I : I I M I I 
248 DFFPEDITVEWQWNGQPAENYKNTQPIMNTNGSYFVYSKLNVQKSNWEAGNTFTCSVLHE 307 

286 ALHNHYTQKSLSLSPGK 302 

I I I I : M I I I I I II I 
308 GLHNHHTEKSLSHSPGK 324 



RESULT 13 




GC 1M_ 


_MOUSE 




ID 


GC1M MOUSE STANDARD; PRT; 393 AA. 




AC 


P01869; 




DT 


21-JUL-1986 (Rel. 01, Created) 




DT 


01-AUG-1991 (Rel. 19, Last sequence update) 




DT 


10-OCT-2003 (Rel. 42, Last annotation update) 




DE 


Ig gamma-1 chain C region, membrane-bound form. 




OS 


Mus mus cuius (Mouse) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 


Euteleostomi ; 


OC 


Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus 


OX 


NCBI TaxID-10090; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RX 


MEDLINE=8 0045036; PubMed=115593; 




RA 


Honjo T., Obata M. , Yamawaki-Kataoka Y., Kataoka T. 


, Kawakami T. , 


RA 


Takahashi N., Mano Y. ; 




RT 


"Cloning and complete nucleotide sequence of mouse 


immunoglobulin 


RT 


gamma 1 chain gene.";- 





RL Cell 18:559-568(1979). 

RN [2] 

RP SEQUENCE OF 323-393 FROM N.A. 

RX MEDLINE=82197 626; PubMed=68 04 950 ; 

RA Tyler B.M., Cowman A.F., Gerondakis S.D., Adams J.M., Bernard O.; 

RT "mRNA for surface immunoglobulin gamma chains encodes a highly 

RT conserved transmembrane sequence and a 28-residue intracellular 

RT domain . " ; 

RL Proc. Natl. Acad. Sci. U.S.A. 79:2008-2012(1982). 

RN [3] 

RP SEQUENCE OF 323-366 FROM N.A. 

RX MEDLINE=821152 95; PubMed=67 9 92 07 ; 

RA Rogers J., Choi E . , Souza L . , Carter C, Word C.J., Kuehl M., 

RA Eisenberg D . f Wall R. ; 

RT "Gene segments encoding transmembrane carboxyl termini of 

RT immunoglobulin gamma chains."; 

RL Cell 26:19-27(1981). 

RN [4] 

RP SEQUENCE OF 1-44 FROM N.A. 

RX MEDLINE=82222190; PubMed=62 8 3537 ; 

RA Yamawaki-Kataoka Y., Nakai S., Miyata T . , Hon jo T.; 

RT "Nucleotide sequences of gene segments encoding membrane domains of 

RT immunoglobulin gamma chains."; 

RL Proc. Natl. Acad. Sci. U.S.A. 79:2 623-2627(1982), 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isof orms=2 ; 

CC Name=Membrane-bound; 

CC IsoId=P018 69-l; Sequence^Displayed; 

CC Name=Secreted; 

CC IsoId=P018 68-l; Sequence=External ; 

CC Note=May be the major isoform; 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; V00793; CAA24172.1; 

DR EMBL; V00793; CAA24173.1; -. 

DR EMBL; V00793; CAA24174.1; ~. 

DR PIR; B02159; G1MSM. 

DR PDB; 15C8; 23-MAR-99. 

DR PDB; 1AE6; 18-MAR-98. 

DR PDB; 1CL7; 12-JAN-00. 

DR PDB; 1F11; 06-FEB-01. 

DR PDB; 1F58; 29-DEC-99. 

DR PDB; 1KC5; 24-JUL-02. 

DR PDB; 1KCR; ll-MAY-02. 

DR PDB; 25C8; 09-JUL-99. 

DR MGD; MGI: 96446; Igh-4 . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig; 2. 



DR 


SMART; SM00407; IGcl; 2 






DR 


PROSITE; 


PS50835; 


IG_LIKE; 3. 




DR 


PROSITE; 


PS00290; 


IG MHC; 1. 




KW 


Immunoglobulin domain; 


Immunoglobulin C region; Glycoprotein; 


KW 


Alternative splicing; Transmembrane; 3D-structure . 


FT 


NON_TER 


1 


1 






FT 


DOMAIN 


1 


97 




CHI. 


FT 


DOMAIN 


98 


110 




HINGE . 


FT 


DOMAIN 


111 


217 




CH2 . 


FT 


DOMAIN 


218 


324 




CH3. 


FT 


DT SUT.FT D 

LJ J, iJ U J_ J_ 


27 


82 






FT 


DISULFID 


102 


102 




INTERCHAIN (WITH A LIGHT CHAIN) . 


FT 


DISULFID 


104 


104 




INTERCHAIN (WITH A HEAVY CHAIN) . 


FT 


DISULFID 


107 


107 




INTERCHAIN (WITH A HEAVY CHAIN) . 


FT 


DISULFID 


109 


109 




INTERCHAIN (WITH A HEAVY CHAIN) . 


FT 


DISULFID 


138 


198 






FT 


CARBOHYD 


174 


174 




N-LINKED (GLCNAC. . .). 


FT 


DISULFID 


244 


302 






FT 


TRANSMEM 


340 


357 




POTENTIAL. 


FT 


DOMAIN 


358 


393 




CYTOPLASMIC (POTENTIAL) . 


SQ 


SEQUENCE 


393 AA; 


433 


86 MW; 


4CC88343B7A1CE27 CRC64; 


Query Match 




49 


.9%; 


Score 820.5; DB 1; Length 393; 


Best Local Similarity 


50 


.9%; 


Pred. No. 1.4e-57; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



Matches 161 ; Conservative 49; Mismatches 71 ; Indels 35 ; Gaps 8 ; 

16 PGSTGDVTMLQMAGQCSQNEYF DSLLHACIPCQLR CSSNTP 56 

III : I I II I : I I I : III 

13 PGSAAQTNSMVTLG-CLVKGYFPEPVTVTWNSGSLSSGVHT-FPAVLQSDLYTLSSSVTV 70 

57 P LTCQRYCNASVTN-SVKGVDKTHTCPP — CPAPELLGGPSVFLFPPKPKDT 105 

I Ml III II: II I Ih I I I : II I I I I I 

71 PSSPRPSETVTCNVAHPASSTKVDKKIVPRDCGCKPCICTVPEV SSVFIFPPKPKDV 127 



106 



128 



166 



188 



226 



248 



LMISRTPEVTCVWDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTVLH 165 
I I : I I : I I I I 1 I I : I : I I I I : I : I : I I I I I I I : I : I I I I I : I I I : I II I : : I 
LTITLTPKVTCVWDISKDDPEVQFSWFVDDVEVHTAQTQPREEQFNSTFRSVSELPIMH 187 

QDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVK 225 
I I I I I I II : I I : I : : I I I I I I I I I I I I I : I : I I I I I : I I : : : : I : : I I I I I : : 
QDWLNGKEFKCRVNSAAFPAPIEKTISKTKGRPKAPQVYTIPPPKEQMAKDKVSLTCMIT 24 7 



GFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHE 
I : I II III: MM III I I : : : : : I I : I : I II I I II I : II I : I I I : I I 
DFFPEDITVEWQWNGQPAENYKNTQPIMNTNGSYFVYSKLNVQKSNWEAGNTFTCSVLHE 



285 



307 



2 86 ALHNHYTQKSLSLSPG 301 

I I I I : I : I II I III 
308 GLHNHHT EKSLSHSPG 323 



RESULT 14 
GCAA_MOUSE 

ID GCAA_MOUSE STANDARD; PRT; 330 AA. 

AC P01863; 

DT 21-JUL-1986 (Rel. 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 



DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Ig gamma-2A chain C region, A allele. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=81076554; PubMed=6777 755 ; 

RA Sikorav J.-L., Auffray C, Rougeon F. ; 

RT "Structure of the constant and 3' untranslated regions of the murine 

RT Balb/c gamma 2a heavy chain messenger RNA. " ; 

RL Nucleic Acids Res. 8:3143-3155(198 0). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=81198976; PubMed=6262729 ; 

RA Yamawaki-Kataoka Y. , Miyata T. , Hon jo T . ; 

RT "The complete nucleotide sequence of mouse immunoglobin gamma 2a gene 

RT and evolution of heavy chain genes : further evidence for intervening 

RT sequence-mediated domain transfer. "; 

RL Nucleic Acids Res. 9:1365-1381(1981). 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=81223894; PubMed=6787604 ; 

RA Olio R. , Auffray C, Morchamps C, Rougeon F. ; 

RT "Comparison of mouse immunoglobulin gamma 2a and gamma 2b chain genes 

RT suggests that exons can be exchanged between genes in a multigenic 

RT family. " ; 

RL Proc. Natl. Acad. Sci. U.S.A. 7 8:2442-2446(1981). 

RN [4] 

RP MYELOMA PROTEIN MOPC 173. 

RX MEDLINE=74175517; PubMed=4 831 97 0 ; 

RA Bourgois A., Fougereau M. , Rocca-Serra J.; 

RT "Determination of the primary structure of a mouse IgG2a 

RT immunoglobulin : amino-acid sequence of the Fc fragment. Implications 

RT for the evolution of immunoglobulin structure and function."; 

RL Eur. J. Biochem. 43:423-435(1974). 

RN [5] 

RP DISULFIDE BONDS. 

RX MEDLINE=73056887 ; PubMed=4 5 654 06 ; 

RA de Preval C, Fougereau M. ; 

RT "Determination of the primary structure of a mouse gamma G2a 

RT immunoglobulin. Identification of the disulfide bridges."; 

RL Eur. J. Biochem. 30:452-4 62(1972). 

CC -!- SIMILARITY: Contains 3 immunoglobulin-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; V00798; CAA24178.1; -. 

DR PIR; A02152; G2MSA. 

DR PDB; 1E4W; 12-JUL-01. 



DR 


PDB; 1E4X; 12-JUL- 


01. 








DR 


PDB; 1MNU; 06-MAY- 


99. 








DR 


InterPro; 


IPR007110; Ig 


-like. 






DR 


InterPro; 


IPR003597; Ig 


cl. 






DR 


InterPro; 


IPR003006; Ig 


_MHC. 






DR 


Pfam; PF00047; ig; 


2. 








DR 


SMART; SM00407; IGcl; 2 








DR 


PROSITE; 


PS50835; 


IG_LIKE; 3. 






DR 


PROSITE; 


PS00290; 


IG MHC; 1. 






KW 


Immunoglobulin domain; 


Immunoglobulin C 


region; 3D-structure 


FT 


NON__TER 


1 


1 








FT 


DOMAIN 


6 


98 




IG-LIKE 1. 




FT 


DOMAIN 


121 


220 




IG-LIKE 2. 




FT 


DOMAIN 


229 


325 




IG-LIKE 3. 




FT 


DISULFID 


15 


15 




INTERCHAIN 


(WITH A LIGHT CHAIN) 


FT 


DISULFID 


27 


82 








FT 


DISULFID 


107 


107 




INTERCHAIN 


(WITH A HEAVY CHAIN) 


FT 


DISULFID 


110 


110 




INTERCHAIN 


(WITH A HEAVY CHAIN) 


FT 


DISULFID 


112 


112 




INTERCHAIN 


(WITH A HEAVY CHAIN) 


FT 


DISULFID 


144 


204 








FT 


DISULFID 


250 


308 








FT 


MOD_RES 


330 


330 




REMOVED POST-TRANSLATIONALLY. 


SQ 


SEQUENCE 


330 AA; 


363 


89 MW; 


B84361C5445A6864 CRC64; 



Repeat . 



Query Match 49.9%; Score 819.5; DB 1; Length 330; 

Best Local Similarity 54.8%; Pred. No. 1.4e-57; 

Matches 161; Conservative 43; Mismatches 77; Indels 13; 



Gaps 



Qy 

Db 



22 VTMLQMAGQCSQNEY-FDSLLHA CI PCQLRCSSNTPP LTCQRYCNASVTNSVKG 74 

II: : I I : I : : I : : : : I : I I : I I Ml I 

37 VTLTWNSGSLSSGVHTFPAVLQSDLYTLSSSVTVTSSTWPSQSITCNVAHPASSTKVDKK 9 6 



QY 
Db 



75 VD KTHTCPP — CPAPELLGGPSVFLFPPKPKDTLMI SRTPEVTCVWDVSHEDPEV 128 

: : III II I I I I I I I I I I : I I I I I I I I II : I I I I I I I I I I : I I : I 

97 IEPRGPTIKPCPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLSPIVTCVWDVSEDDPDV 156 



Qy 

Db 

Qy 

Db 

Qy 

Db 



129 KFNW WDGVEVHNAKT KP REEQ YNST YRVVS VLTVLHQDWLNGKE YKCKVSNKAL PAP I E 18 8 

: : I : I : I I I I I : I : I I I I I I I I I I : I I I I : : I I I : I I I I : I I MINI 
157 QISWFVNNVEVHTAQTQTHREDYNSTLRWSALPIQHQDWMSGKEFKCKVNNKDLPAPIE 216 



189 



248 



KTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQPENNYKT 
: I II I II I I I I I I I I : I : I I I I : M I : I I I I I I I I : I I : I I I I 
217 RTISKPKGSVRAPQVYVLPPPEEEMTKKQVTLTCMVTDFMPEDIYVEWTNNGKTELNYKN 276 

249 TPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKSLSLSPGK 302 

I I I I I I I I I : I : I I I I I : I I : I : I I I I : I I I I I I : I II I : I I I 
277 TEPVLDSDGSYFMYSKLRVEKKNWVERNSYSCSWHEGLHNHHTTKSFSRTPGK 330 



RESULT 15 
GCAB_MOUSE 

ID GCAB_MOUSE STANDARD; PRT; 335 AA. 

AC P01864; 

DT 21-JUL-1986 (Rel. 01, Created) 

DT 21-JUL-1986 (Rel. 01, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Ig gamma-2A chain C region secreted form (B allele) . 



OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6; 

RX MEDLINE=82037861; PubMed=617 00 65 ; 

RA Schreier P.H., Bothwell A.L.M., Mueller-Hill B., Baltimore D.; 

RT "Multiple differences between the nucleic acid sequences of the 

RT IgG2aa and IgG2ab alleles of the mouse."; 

RL Proc. Natl. Acad. Sci. U.S.A. 78:4495-4 4 99(1981). 

RN [2] 

RP SEQUENCE. 

RX MEDLINE-82037777; PubMed=67 94027 ; 

RA Dognin M.J., Lauwereys M. , Strosberg A.D.; 

RT "Multiple amino acid substitutions between murine gamma 2a heavy 

RT chain Fc regions of Igla and Iglb allotypic forms."; 

RL Proc. Natl. Acad. Sci. U.S.A. 78:4031-4035(1981). 

CC -!- SUBCELLULAR LOCATION: Secreted (Potential). 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=Secreted; 

CC IsoId=P018 64-l; Sequence=Displayed; 

CC Note=Probably the major isoform; 

CC Name— Membrane-bound; 

CC IsoId=P01865-l; Sequence=External ; 

CC -!- MISCELLANEOUS: The sequence differs from that of the a allele, 

CC from BALB/c mice, at 15% of the positions. 

CC -!- SIMILARITY: Contains 3 immunoglobulin-like domains. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; J00479; -; NOT_ANNOTATED_CDS . 

DR PIR; A02153; G2MSAB. 

DR PDB; 1BOG; 23-MAR-99. 

DR PDB; 1HH6; 26-JAN-01. 

DR PDB; 1HH9; 24-JUL-03. 

DR PDB; 1HI6; 08-FEB-01. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003597; Ig_cl. 

DR InterPro; IPR003006; Ig_MHC. 

DR Pfam; PF00047; ig; 2. 

DR SMART; SM004 07; IGcl; 2. 

DR PROSITE; PS50835; IGJLIKE; 3. 

DR PROSITE; PS00290; IG_MHC; 1. 

KW Immunoglobulin domain; Immunoglobulin C region; Alternative splicing; 

KW 3D-structure; Repeat. 

FT NON_TER 1 1 

FT DOMAIN 6 98 IG-LIKE 1. 

FT DOMAIN 126 225 IG-LIKE 2. 



FT DOMAIN 234 330 IG-LIKE 3. 

SQ SEQUENCE 335 AA; 36596 MW; FA3382792CBB13C6 CRC64; 



Query Match 49.8%; Score 818.5; DB 1; Length 335; 

Best Local Similarity 50.8%; Pred. No. 1.7e-57; 

Matches 162; Conservative 51; Mismatches 73; Indels 33; Gaps 6 

Qy 17 GSTGD VTMLQMAGQCSQNEY-FDSLLHA CIPCQLRCSSNTPP 57 

I : I I II: : I I : I : I I : : : : || I I 

Db 17 GTTGSSVTLGCLVKGYFPEPVTLTWNSGSLSSGVHTFPALLQSGLYTLSSSVTVTSNTWP 7 6 

Qy 58 LT CQRYCNAS VTN S VKGVD KTHTC PPCPAPELLGGPSVFLFPPKPK 103 

: I I III I : : : I I I I I I : I I I I II I I : I II I I 

Db 77 SQTITCNVAHPASSTKVDKKIEPRVPITQNPCPPHQRVPPCAAPDLLGGPSVFIFPPKIK 136 

Qy 104 DTLMISRTPEVTCWVDVSHEDPEVKFNWYVDGVEVHNAKTKPREEQYNSTYRWSVLTV 163 

I MM : I I II I I I M I M I : I : : I : I : MM MM I MM II II I : 

Db 137 DVLMISLSPMVTCVWDVSEDDPDVQISWFVNNVEVHTAQTQTHREDYNSTLRWSALPI 196 

Qy 164 LHQDWLNGKEYKCKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCL 223 

I II I : M I I M II I M M I I M I I I I I I I : I I I I I I III M : I I : MM: 
Db 197 QHQDWMSGKEFKCKVNNRALPSPIEKTISKPRGPVRAPQVYVLPPPAEEMTKKEFSLTCM 256 

Qy 224 VKGFYPSDIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVM 283 

: II I : : I I I : I MM I Ml I II II I I I : M I I I I I II I : : I : : I : I II : 
Db 257 ITGFLPAEIAVDWTSNGRTEQNYKNTATVLDSDGSYFMYSKLRVQKSTWERGSLFACSW 316 

Qy 284 HEALHNHYTQKSLSLSPGK 302 

II I I I I I MM I M 

Db 317 HEVLHNHLTTKTISRSLGK 335 



Search completed: April 19, 2004, 13:21:31 
Job time : 13.428 sees 



